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UDP-GALACTOSE: p-A^-ACETYL- 
GLUCOSAMINE Bl.3 GALACTOSYLTRANSFERASES. P3GAL-T5 

This invention claims priority in the United States under 35 U.S.C. § 1 19 to 
Denmark AppHcation No. PA 1998 01483 filed November 13, 1998, which application is 
incorporated by reference herein in its entirety. 

1, FIELD OF THE INVENTION 

The present invention relates generally to the biosynthesis of glycans found 
as free oligosaccharides or covalently bound to proteins and glycosphingolipids. This 

10 

invention is more particularly related to a family of nucleic acids encoding UDP-D- 
galactose:p7S^-acetylglucosamine pl,3-galactosyltransferases (pSGal-transferases), which 
add galactose to the hydroxy group at carbon 3 of 2-acetamido-2-deoxy-D-glucose 
(GlcNAc). This invention is more particularly related to a gene encoding the fifth member 
of the family of p3Gal-transferases, termed p3Gal-T5, probes to the DNA encoding p3Gal- 
T5, DNA constructs comprising DNA encoding p3Gal-T5, recombinant plasmids and 
recombinant methods for producing p3Gal-T5, recombinant methods for stably transfecting 
cells for expression of p3Gal-T5, and methods for indication of DNA polymorphism in 
patients. 

20 

2. BACKGROUND OF THE INVENTION 

A family of UDP-galactose; p-7V-acetyl-glucosamine pl-3galactosyl- 
transferases (p3Gal-T*s) was recently identified (Amado, M., Almeida, R., Cameiro, P., et 
al. A family of human p3-galactosyltransferases: characterisation of four members of a 

25 UDP-galactose p-N-acetylglucosamine/p-N-acetylgalactosamine pl,3-Galactosyltransferase 
family. J. Biol Chem. 273:12770-12778, 1998; Kolbinger, F., Streiff, M.B. and 
Katopodis, A.G, Cloning of ahumanUDP-galactose:2- acetamido-2-deoxy-D-glucose 3P- 
galactosyltransferase catalysing the formation of type 1 chains, J, Biol. Chem. 273:433- 
440, 1998; Hennett, T., Dinter, A., Kuhnert, P., Mattu, T.S., Rudd, P.M. and Berger, E.G. 

3 Q Genomic cloning and expression of three murine UDP-galactose: P-N-acetylglucosamine 
pi,3-galactosyltransferase genes. J, Biol Chem. 273:58-65, 1998; Miyaki, H., Fukumoto, 
S., Okada, M., Hasegawa, T. and Furukawa, K. Expression cloning of rat cDNA encoding 
UDP-galactose G(D2) pi, 3 galactosyltransferase that determines the expression of G(D1 
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b)/G(M l)G(Al). J.BioLChem. 272:24794-24799,1997). Three genes within this family, 
p3Gal-Tl, -T2, and -T3, encode pSgalactosyltransferases that form the Gaipi-3GlcNAc 
linkage. The type 1 chain Gaipi-3GlcNAc sequence is found in both N- and O-linked 
^ oligosaccharides of glycoproteins and in lactoseries glycosphingolipids, where it is the 
counterpart of type 2 GaIpl-4GlcNAc poly-A^acetyllactosamine structures (Kobata. A. 
Structures and functions of the sugar chains of glycoproteins. EurJBiochem 209:483-501, 
1992.). Type 1 chain structures are found mainly in endodermally derived epithelia, 
whereas the type 2 chains are found in ecto- and mesodermally derived cells including 
erythrocytes (Oriol, R., Le Pendu, J. and MolUcone, R. Genetics of ABO, H, Lewis, X and 
related antigens. Vox Sanguinis 51:161-171, 1986; Clausen, H. and Hakomori, S. ABH and 
related histo-blood group antigens; immunochemical differences in carrier isotypes and 
their distribution. Vox Sanguinis 56:l-2Q, 1989). Normal gastro-intestinal epithelia express 
mainly type 1 chain glycoconjugates, while type 2 chain structures are predominantly 
^5 expressed in tumors (Hakomori, S. Abeirant glycosylation in tumors and tumor-associated 
carbohydrate antigens. Tumor malignancy defined by abeirant glycosylation and 
sphingo(glyco)lipid metabolism. Advances in Cancer Research 52:257-331, 1989; 
Hakomori, S. Tumor malignancy defined by aberrant glycosylation and 
sphingo(glyco)lipid metabolism. Cancer 56:5309-5318, 1996). It is of considerable 
20 interest to define the gene(s) responsible for formation of these core structures in normal 
and malignant epithelia. Several characteristics of the three previously described p3Gal-Ts 
capable of forming type 1 chain structures suggest that these are not the major enzyme(s) 
involved in type 1 chains synthesis in epithelia: (i) Northern analysis indicates that p3Gal- 
Tl and -T2 are exclusively expressed in brain (Amado, M., Almeida, R., Cameiro, F., et al. 
2 5 A family of human p3-galactosyltransferases: characterisation of four members of a UDP- 
galactose P-N-acetylglucosamine/p-N-acetylgalactosamine p 1 ,3-Galactosyltransferase 
family. J. Biol. Chem. 273:12770-12778, 1998; Kolbinger, F., Streiff, M.B. and Ktopodis, 
A.G. Cloning of a human UDP-galactose:2- acetamido-2-deoxy-D-glucose 3p- 
galactosyltransferase catalysing the formation of type 1 chains. J. Biol. Chem. 273:433- 
30 440, 1998; Hennett, T., Dinter, A., Kuhnert, P., Mattu, T.S., Rudd, P.M. and Berger. E.G. 
Genomic cloning and expression of three murine UDP-galactose: P-N-acetylglucosamine 
pl,3-galactosyltransferase genes. J. Biol. Chem. 273:58-65, 1998); (ii) although p3Gal-T3 
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has a wider expression pattern it is not detected in several tissues including colon and it is 
weakly expressed in gastric mucosa (Amado, M., Almeida, R., Cameiro, F., et al. A family 
of human p3-galactosyltransferases: characterisation of four members of a UDP-galactose 
p-N-acetylglucosamine/p-N-acetylgalactosamine pl,3-Galactosyltransferase family. J. 
Biol Chem. 273:12770-12778, 1998; Kolbinger, F., Streiff, M.B. and Ktopodis, A.G. 
Cloning of a human UDP-galactose:2- acetamido-2-deoxy-D-glucose 3p- 
galactosyltransferase catalysing the formation of type 1 chains. J. Biol Chem. 273:433- 
440, 1998); (iii) the kinetic properties of recombinant enzymes are not consistent with those 
reported for P3Gal"T activities in epithelia (Sheares, B.T., Lau, J.T. and Carlson, D.M. 
Biosynthesis of galactosyl-beta 1,3-N- acetylglucosamine. J, Biol Chem. 257:599-602, 
1982; Hoknes, E.H. Characterization and membrane organization of beta 1 — 3- and beta 
1 — 4- galactosyltransferases from human colonic adenocarcinoma cell lines Cob 205 and 
SW403: basis for preferential synthesis of type 1 chain lacto-series carbohydrate structures. 
Arch Biochem Biophys 270:630-646, 1989); and (iv) the acceptor substrate specificities of 
P3Gal-Tl, -T2, or -T3 do not include the mucin-type core 3 structure (Amado, M., Almeida, 
R., Cameiro, F., et al. A family of human p3-galactosyltransferases: characterisation of 
four members of a UDP-galactose p-N-acetylglucosamine/p-N-acetylgalactosamine pl,3- 
Galactosyltransferase family. J.Biol Chem. 273:12770-12778, 1998; Hennett, T., Dinter, 
A., Kuhnert, P., Mattu, T.S., Rudd, P.M. and Berger, E.G. Genomic cloning and 
expression of three murine UDP-galactose: P-N-acetylglucosamine pl,3- 
galactosyltransferase genes. J. Biol Chem. 273:58-65, 1998), which was previously found 
to be a highly efficient substrate for p3Gal-T activity isolated from porcine trachea 
(Sheares, B.T. and Carlson, D.M. Characterization of UDP-galactose:2-acetamido-2- 
deoxy-D- glucose 3 beta-galactosyltransferase from pig trachea. X Biol Chem, 258:9893- 
9898, 1983). 

Access to additional existing pGlcNAc p3Gal-transferase genes encoding 
p3Gal-transferases with better kinetic properties than p3Gal-Tl, -T2, and -T3 would allow 
production of more efficient enzymes for use in galactosylation of oligosaccharides, 
glycoproteins, and glycosphingolipids. Such enzymes could be used, for example, in 
pharmaceutical or other commercial applications that require synthetic galactosylation of 
these or other substrates that are not or poorly acted upon by p3Gal-Tl , -T2, and -T3, in 
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order to produce appropriately glycosylated glycoconjugates having particular enzymatic, 
immunogenic, or other biological and/or physical properties. 

Consequently, there exists a need in the art for additional isolated UDP- 
^ galactose: P-A^-acetyl-glucosamine pl-3Galactosyltransferases having unique, specific 
properties and the primary stmcture of the genes encoding these enzymes. The present 
invention meets this need, and further presents other related advantages, as described in 
detail below. 

10 3- SUMMARY OF THF TNVENTION 

The present invention provides isolated nucleic acids encoding human UDP- 
galactose: p3-N-acetylglucosamine pi,3-galactosyltransferase (P3Gal-T5), including cDNA 
and genomic DNA. p3Gal-T5 has better kinetic properties than P3GaI-Tl , -T2. and T3, as 
exemplified by its better activity with saccharide derivatives and glycoprotein substrates as 
well as its activity with globosideglycolipid. Indeed, p3Gal-T5 is the first 
glycosyltransferase available for transfer of Gal pl-3 to globoside (GaINAcpi-3Galal- 
4Gaipi-4Glcpi-Cer). The complete nucleotide sequence of p3Gal-T5, is set forth in Figure 

In one aspect, the invention encompasses isolated nucleic acids comprising 
2 0 or consisting of the nucleotide sequence of nucleotides 1-933 as set forth in Figure 1, or 
sequence-conservative or fimction-conseivative variants thereof Also provided are isolated 
nucleic acids hybridizable with nucleic acids having the sequence as set forth in Figure 1 or 
fragments thereof or sequence-conservative or function-conservative variants thereof In 
various embodiments, the nucleic acids of the invention are hybridizable with P3Gal-T5 
2 5 sequences under conditions of low stringency, intermediate stringency, high stringency, or 
specific preferred stringency conditions defined herein. In one embodiment, the DNA ' 
sequence encodes the amino acid sequence, as set forth in Figure 1. fi-om methionine (amino 
acid no. 1) to valine (amino acid no. 3 10). In another embodiment, the DNA sequence 
encodes an amino acid sequence comprising a sequence from methionine (no. 25) to valine 
30 (no. 3 1 0) as set forth in Figure 1 . 

In a related aspect, the invention provides nucleic acid vectors comprising 
P3Gal-T5 DNA sequences, including but not limited to those vectors in which the p3Gal- 

- 4 - 
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T5 DNA sequence is operably linked to a transcriptional regulatory element (e.g. a 
promoter, an enhances, or both) , with or without a polyadenylation sequence. Cells 
comprising these vectors are also provided, including without limitation transiently and 
stably expressing cells. Viruses, including bacteriophages, comprising p3Gal-T5-derived 

5 

DNA sequences are also provided. The invention also encompasses methods for producing 
P3Gal-T5 polypeptides. Cell-based methods include without limitation those comprising: 
introducing into a host cell an isolated DNA molecule encoding P3Gal-T5, or a DNA 
construct comprising a DNA sequence encoding P3Gal-T5; growing the host cell under 
conditions suitable for p3Gal-T5 expression; and isolating p3Gal-T5 produced by the host 
cell. Fxirther, this invention provides a method for generating a host cell with de novo stable 
expression of p3Gal-T5 comprising: introducing into a host cell an isolated DNA molecule 
encoding P3Gal-T5 or an enzymatically-active fragment thereof (such as, for example, a 
polypeptide comprising amino acids 25-310 as set forth in Figure 1), or a DNA construct 
comprising a DNA sequence encoding P3Gal-T5 or an enzymatically active fragment 
thereof; selecting and growing host cells in an appropriate medium; and identifying stably 
transfected cells expressing p3Gal-T5. The stably transfected cells may be used for the 
production of P33Gal-T5 enzyme for use as a catalyst and for recombinant production of 
peptides or proteins with appropriate galactosylation. For example, eukaryotic cells, 

2 Q whether normal or diseased cells, having their glycosylation pattern modified by stable 

transfection as above, or components of such cells, may be used to deliver specific 
glycoforms of glycopeptides and glycoproteins, such as, for example, as immunogens for 
vaccination. 

In yet another aspect, the invention provides isolated P3Gal-T5 polypeptides, 
2^ including without limitation polypeptides having the sequence set forth in Figure 1, 
polypeptides having the sequence of amino acids 25-310 as set forth in Figure 1, and a 
fusion polypeptide consisting of at least amino acids 25-310 as set forth in Figure 1 fused in 
frame to a second sequence, which may be any sequence that is compatible with retention of 
P3Gal-T5 enzymatic activity in the fusion polypeptide. Suitable second sequences include 

3 Q without limitation those comprising an affinity ligand, a reactive group, and/or a functional 

domain from another protein. 

- 5 - 
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In another aspect of the present invention, methods are disclosed for 
screening for mutations in the coding region (exon I) of the p3Gal-T5 gene using genomic 
DNA isolated from, e.g., blood cells of normal and/or diseased subjects. In one 
g embodiment, the method comprises: isolation of DNA from a normal or diseased subject; 
PGR amplification of coding exon I; DNA sequencing of amplified exon DNA fragments 
and establishing therefrom potential structural defects of the p3Gal-T5 gene associated with 
disease. 

These and other aspects of the present invention will become evident upon 
^ Q reference to the following detailed description and drawings. 

4- BRIEF DES CRIPTION OF THE PR AWTIVriR 

FIG. 1 depicts the DNA sequence of the p3Gal-T5 gene (SEQ ID NO:8) and 
the predicted amino acid sequence of p3Gal-T5 (SEQ ID NO:9). The amino acid sequence 
is shown in single-letter amino acid code. The hydrophobic segment representing the 
putative transmembrane domain is underlined with a double line (Kyte & Doolittle, window 
of 8 (Kyte, J. and Doolittle, R.F. A simple method for displaying the hydropathic character 
ofaprotein. Journal of Molecular Biology ISIMS-Ul, 1982)). Three consensus motifs 

2 0 for ^-glycosylation are indicated by asterisks. The location of the primers used for 

preparation of the expression constructs are indicated by single underlining The single- 
letter amino acid code corresponds to the three-letter amino acid code of the Sequence 
Listing set forth hereinbelow, as follows: A, Ala; R, Arg; N, Asn; D. Asp; B, Asx; C, Cys; 
Q, Ghi; E, Glu; Z, Glx; G. Gly; H, His; I, He; L, Leu; K, Lys; M, Met; F, Phe; P, Pro; S, 

25 Ser; T, Thr; W. Trp; Y, Tyr; and V, Val. 

FIG. 2 is an illustration of multiple sequence analysis (ClustalW) of five 
human P3Gal-transferases. The transferases are listed according to order of similarity with 
p3GaI-Tl. The SEQ ID NOs for the transferases shown are as follows: p3Gal-Tl (SEQ ID 
30 N0:1 1), p3Gal-T2 (SEQ ID NO:10), p3Gal-T3 (SEQ ID NO:12), P3Gal-T4 (SEQ ID 

NO: 13) and P3Gal-T5 (SEQ ID NO:9). Introduced gaps are shown as hyphens, and aligned 
identical residues are boxed (black for all sequences, dark grey for four sequences, and light 

- 6 - 
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grey for three sequences). The putative transmembrane domains are underlined with a 
single line. The positions of conserved cysteines are indicated hy asterisks. One conserved 
A^-glycosylation site is indicated by an open circle. The DxD motif is indicated by an 
arrow. 

5 

FIG, 3 is a schematic depiction of pSGal-transferases aligned for the 
conserved cysteine residues. Potential A^-glycosylation sites are indicated by trees. 
Cysteine residues are indicated by the letter C, and conservation of cysteines are indicated 
by stippled lines between genes. The position of conserved sequence motifs as shown in 
Figure 2 are indicated with dotted lines and amino acid sequences. The putative 
transmembrane signal is indicated by thick lines, 

FIG. 4 depicts sections of a 1-D ^H-NMR spectrum of the p3Gal-T5 product 
. _ with Core3-/?NPh, Galpl-3GlcNAcpl-3GalNAcal-lj9NPh, showing all non-exchangeable 
monosaccharide ring methine and exocyclic methylene resonances. Residue designations 
for the Gaipi-^3(Galp3), GlcNAcPl-3 (GlcNAcp3), GalNAcal-l(a) are followed by 
proton designations (Braunschweiler, L. and Ernst, R.R. Coherence transfer by isotropic 
mixing: Application to proton correlation spectroscopy. J, Mag?i, Reson, 53:521-528, 1983; 
2Q Bax, A. and Davis, D.G. MLEV-1 7-based two-dimensional homonuclear magnetization 
transfer spectroscopy. J, Magn, Reson. 65:355-360, 1985a; Bothner-By, A.A., Stephens, 
R.L., Lee, J.M., Warren, CD. and Jeanloz, R.W. Structure determination of a 
tetrasaccharide: Transient nuclear Overhauser effects in the rotating frame. J,Am, Chem. 
Soc 106:811-813, 1984; Bax, A. and Davis, D.G. Practical aspects of two-dimensional 
25 transverse NOE spectroscopy. J, Magn. Reson. 63:207-213, 1985b; Keeler, J., Laue, E.D. 
and Moskau, D. Experiments for recording pure-absorption heteronuclear correlation 
spectra using pulsed field gradients. J. Magn, Reson. 98:207-216, 1992; Bodenhausen, G. 
and Ruben, D.J. Natural abundance nitrogen-15 NMR by enhanced heteronuclear 
spectroscopy. Chem. Phys, Lett, 69:185-189, 1980). 

30 

FIG. 5 is a photographic illustration of Northem blot analysis of human 
tumor cell lines. Human pancreatic adenocarcinoma cell lines AsPC-1, BxPC-3, Capan-1, 
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Capan-2, Colo357, HPAF, PANC-1, Suit2. S2-013, and the HT29 colon adenocarcinoma 
cell line were probed with "P-labeled cDNA of p3Gal-T5 corresponding to the soluble 
expression construct. 

5 

5- DETAILE D DESCRTPTION OF THE INVENTION 

All patent applications, patents, and literature references cited in this 
specification are hereby incorporated by reference in their entirety. In the case of conflict, 
the present description, including defmitions, is intended to control. 

10 

5.1. DEFINITIONS 

1 . "Nucleic acid" or "polynucleotide" as used herein refers to purine- 
and pyrimidine-containing polymers of any length, either polyribonucleotides or 
polydeoxyribonucleotides or mixed polyribo-polydeoxyribo nucleotides. This includes 
single-and double-stranded molecules, i.e.. DNA-DNA, DNA-RNA and RNA-RNA 
hybrids, as well as "protein nucleic acids" (PNA) formed by conjugating bases to an amino 
acid backbone. This also includes nucleic acids containing modified bases (see below). 

2. "Complementary DNA or cDNA" as used herein refers to a DNA 
molecule or sequence that has been enzymatically synthesized fi-om the sequences present 
2 0 3n niRNA template, or a clone of such a DNA molecule. A "DNA Construct" is a DNA 
molecule or a clone of such a molecule, either single- or double-stranded, which has been 
modified to contain segments of DNA that are combined and juxtaposed in a manner that 
would not otherwise exist in nature. By way of non-limiting example, a cDNA or DNA 
which has no introns is inserted adjacent to, or within, exogenous DNA sequences. 
25 3. A plasmid or, more generally, a vector, is a DNA construct 

containing genetic information that may provide for its replication when inserted into a host 
cell. A plasmid generally contains at least one gene sequence to be expressed in the host 
cell, as well as sequences that facilitate such gene expression, including promoters and 
transcription initiation sites. It may be a linear or closed circular molecule. 
30 4. Nucleic acids are "hybridizable" to each other when at least one 

strand of one nucleic acid can anneal to another nucleic acid under defined stringency 
conditions. Stringency of hybridization is determined, e.g., by a) the temperature at which 

- 8 - 
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hybridization and/or washing is performed, and b) the ionic strength and polarity (e.g., 
formamide) of the hybridization and washing solutions, as well as other parameters. 
Hybridization requires that the two nucleic acids contain substantially complementary 
sequences; depending on the stringency of hybridization, however, mismatches may be 
tolerated. Typically, hybridization of two sequences at high stringency (such as, for 
example, in an aqueous solution of 0.5X SSC, at 65 ""C) requires that the sequences exhibit 
some high degree of complementarity over their entire sequence. Conditions of 
intermediate stringency (such as, for example, an aqueous solution of 2X SSC at 65 ""C) and 
low stringency (such as, for example, an aqueous solution of 2X SSC at 55 °C), require 
correspondingly less overall complementarily between the hybridizing sequences. (IX SSC 
is 0.15 M NaCl, 0.015 M Na citrate.) 

In one embodiment, this invention provides nucleic acids which are 
hybridizable to a p3Gal-T5 nucleic acid imder the following hybridization conditions: a 
full-length or soluble p3Gal-T5 expression construct (see Examples) is used as probe (e.g. 
by random primed labeling) against a DNA or RNA blot, the blot is probed overnight at 
42 ''C as previously described (Bennett et al., 1996, cDNA cloning and expression of a 
novel human UDP-N-acetyl-alpha-D-galactosamine, Polypeptide N-acetyl-galactosaminyl- 
transferase, GalNAc-T3, J. Biol Chem. 271, 17006-17012), washed 2x10 min at room 
temperature (RT; from 1 8 to 23 °C) with 2 x SSC, 1 % Na4P202 , 2 x 20 min at 65 °C with 
0.2 X SSC, 1 % SDS, 1% Na4P202 and once 10 min with 0.2 x SSC at RT ("preferred 
hybridization conditions")- Under these preferred hybridization conditions, there is no 
cross-hybridization between p3Gal-T5 and the previously-identified P3Gal-Ts (i.e. p3Gal- 
Tl, -T2, -T3, and T4; see also Amado et al., 1998, A family of human P3- 

25 galactosyltransferases: characterization of four members of a UDP -galactose p-N- 

acetylglucosamine/p-N-acetylgalactosamine pl,3-Galactosyltransferase family, J. Biol 
Chem, 273, 12770-12778). 

5. An "isolated" nucleic acid or polypeptide as used herein refers to a 
component that is removed from its original environment (for example, its natural 

3 Q environment if it is naturally occurring). An isolated nucleic acid or polypeptide contains 
less than about 50%, preferably less than about 75%, and most preferably less than about 
90%, of the cellular components with which it was originally associated. 
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6. A "probe" refers to a nucleic acid that forms a hybrid structure with a 
sequence in a target region due to complementarily of at least one sequence in the probe 
with a sequence in the target region. 

5 7. A nucleic acid that is "derived from" a designated sequence refers to 

a nucleic acid sequence that corresponds to a region of the designated sequence. This 
encompasses sequences that are homologous or complementary to the sequence, as well as 
"sequence-conservative variants" and "function-conservative variants". Sequence- 
conservative variants are those in which a change of one or more nucleotides in a given 
codon position results in no alteration in the amino acid encoded at that position. Function- 
conservative variants of p3Gal-T5 are those in which a given amino acid residue in the 
polypeptide has been changed without altering the overall conformation and enzymatic 
activity (including substrate specificity) of the native polypeptide; these changes include, 
but are not limited to, replacement of an amino acid with one having similar physico- 
chemical properties (such as, for example, acidic, basic, hydrophobic, and the like). 

8. A "donor substrate" is a molecule recognized by, e.g., a 
galactosyltransferase and that contributes a galactosyl moiety for the transferase reaction. 
For P3GaI-T5, a donor substrate is UDP-galactose. An "acceptor substrate" is a molecule, 
preferably a saccharide or oligosaccharide, that is recognized by, e.g., a galatosyltransferase 

2 0 and that is the target for the modification catalyzed by the transferase, i.e., receives the 

galatosyl moiety. For P3Gal-T5, acceptor substrates include without limitation 
oligosaccharides, glycoproteins, O-linked GIcNAc-glycopeptides, O-linked GalNAc- 
glycopeptides, and glycosphingolipids containing the sequences, GlcNAcpi-6Gal, 
GlcNAcPl-6GalNAc, GlcNAcpi-3 GalNAc, GlcNAcpl -2Man, GlcNAcpi-4Man, 
25 GlcNAcpi-6Man, GlcNAcpi-3Man, Glcpl-ceramide, and GalNAcpi-3Gal. 

The present invention provides the isolated DNA molecules, includmg 
genomic DNA and cDNA, encoding the UDP-galactose: P-^-acetylglucosamine pi,3- 
galactosyltransferase (p3Gal-T5). 

p3Gal-T5 was identified by analysis of EST database sequence information, 

3 0 and cloned based on EST and 5 'RACE cDNA clones. The cloning strategy may be briefly 

summarized as follows: 1) synthesis of oligonucleotides derived fi-om EST sequence 
information, designated EBER1301 and EBER 1302; 2) PGR screening and isolation of a 
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PI genomic DNA phage containing the entire coding region of p3Gal-T5; 3) sequencing of 
PI DNA; 4) identification of a novel DNA sequence corresponding to P3Gal-T5; 5) 
construction of expression constructs by reverse-transcription-polymerase chain reaction 
(RT-PCR) using human PI DNA; 6) expression of the cDNA encoding p3Gal-T5 in Sf9 
(Spodoptera frugiperda) cells. More specifically, the isolation of a representative DNA 
molecule encoding a novel fifth member of the mammalian UDP-galactose: p-7V- 
acetylglucosamine/p-A^-acetylgalactosamine pi,3-galactosyltransferase family involved the 
following procedures described below. 

5.2. IDENTIFICATION AND CLONING OF HUMAN B3Gal-T5 

A novel gene, with significant sequence similarity to the p3Gal-transferase 
gene family was identified (Fig 1), using the strategy as previously described (Almeida, R., 
Amado, M., David, L., et al. A Family of Human p4-Galactosyltransferases: Cloning and 
expression of two novel UDP-Galactose p-N-Acetylglucosamine pi,4-Galactosyl- 
transferases, p4Gal-T2 and p4Gal-T3. XBiolChem. 272:31979-31992,1997). The 
predicted coding region of P3Gal-T5 included two potential initiation codons, preceding a 
hydrophobic sequence, of which the second is in agreement with Kozak's rule (Kozak, M. 
Regulation of translation in eukaryotic systems. Ann Rev Cell Biol 8:197-225, 1992) (Fig. 
1). The predicted coding sequence indicates that p3Gal-T5 is an type II transmembrane 
glycoprotein with a N-terminal cytoplasmic domain of 2 or 7 residues, a transmembrane 
segment of 19 residues flanked by charged residues, and a stem region and catalytic domain 
of 284 residues with three potential N-glycosylation sites (Fig. 1). A Kyte and Doolittle 
hydropathy plot (Kyte, J. and Doolittle, R.F. A simple method for displaying the 
hydropathic character of a protein. Journal of Molecular Biology 157:105-132, 1982) 
indicated that the putative stem region was hydrophilic similar to pGal-Tl, -T2 and -T3 
(Amado, M., Almeida, R., Cameiro, F., et al. A family of human p3-galactosyltransferases: 
characterization of four members of a UDP-galactose p-N-acetylglucosamine/p-N- 
acetylgalactosamine pi, 3-Galactosyltransferase family. J.Biol Chem. 273:12770-12778, 
1998). In contrast, P3Gal-T4, with exclusive glycolipid specificity has a hydrophobic stem 
region (Amado, M., Almeida, R., Cameiro, F., et al. A family of human p3- 
galactosyltransferases: characterisation of four members of a UDP-galactose P-N- 
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acetylglucosamine/p-N-acetylgalactosamine p 1 ,3-Galactosyltransferase family. J. Biol. 

Chem. 273:12770-12778, 1998; Miyaki, H., Fukumoto, S.. Okada, M.. Hasegawa, T. and 

Furukawa, K. Expression cloning of rat cdna encoding UDP-galactose G(D2) pi,3 
^ galactosyltransferase that determines the expression of G(D1 b)/G(M 1)G(A1). J. Biol. 

Chem. 272:24794-24799, 1997). 

A multiple sequence alignment of five p3Gal-transferases is shown in Figure 

2. The P3Gal-T5 gene has highest similarity to p3Gal-T2. Similarities among the five 
himian genes are found predominantly in the central regions; there were no significant 
similarities in the NHj-terminal regions. Several motifs in the putative catalytic domains 
are conserved between all the sequences. Noteworthy, three cysteine residues are aligned 
within all the human genes, and three additional are aligned within P3Gal-Tl, -T2, -T3 and 
-T5 (Fig. 2, Fig. 3). One potential N-linked glycosylation site, occurs in the central region 
of the putative catalytic domains, and is conserved in all sequences. Similarly, a single N- 
linked glycosylation site was conserved among all members of p4Gal-T gene family 
(Schwientek, T., Almeida, R., Levery, S.B.. Hobnes, E., Bennett, E.P. and Clausen, H. 
Cloning of a novel member of the UDP-galactose: P-N-acetylglucosamine pi ,4- 
galactosyltransferase family, p4Gal-T4, involved in glycosphingolipid biosynthesis. J. Biol 
Chem. 273:29295-29305, 1998 Schwientek et al., 1998). The DXD motif, recently shown 
20 to be conserved among several glycosyltransferases gene families (Wiggins, C.A.R. and 
Munro, S. Activity of the yeast MNNlalfa-l,3-mannosyltransferase requires a motif 
conserved in many other families of glycosyltransferases. Proc. Natl. Acad. Sci. USA 
95:7945-7950, 1998; Breton, C, Bettler, E., Joziasse, D.H., Geremia, R.A. and Imberty. A. 
Sequence-fimction relationships of prokaryotic and eukaryotic galactosyltransferases. J 
25 Biochem 123:1000-1009, 1998), is also present in all human p3Gal-transferases. 

5-3. GENOMIC ORGANIZATION AND CHROMOSOMAL 
LOCALIZ ATION OF P3GAL-T5. BGALT'i 
The coding region of p3GaI-T5 was determined by sequencing of PI clones 
30 to be located in a single exon, similar to pOal-Tl, -T2, -T3 and -T4 (Amado, M., Almeida, 
R., Cameiro, F., et al. A family of human P3-galactosyltransferases: characterisation of 
four members of a UDP-galactose P-N-acetylglucosamine/p-N-acetylgalactosamine pl,3- 
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Galactosyltransferase family. J, Biol Chem, 273:12770-12778,1998). This was 
confirmed in a recently released 164 kb genomic sequence (GenBank accession number 
AF064860). 5GvlLr5 is located on chromosome 21 q22.3. The other three genes in the 
family are located on different chromosomes BGALT2 (IqSl), -T3 (3q25), and -TV (6p21 .3) 
(Amado, M., Almeida, R., Cameiro, F., et al. A family of human p3-galactosyltransferases: 
characterisation of four members of a UDP-galactose p-N-acetylglucosamine/p-N- 
acetylgalactosamine pi, 3-Galactosyltransferase family. J.Biol Chem. 273:12770-12778, 
1998). 

10 

5.4. EXPRESSION OF P3GAL-T5 IN INSECT CELLS 

Expression of a soluble constmct of p3Gal-T5 in Sf9 cells resulted in a 
marked increase (20-30 fold) in galactosyltransferase activity using acceptor substrates 
containing terminal PGlcNAc, when compared to uninfected cells or cells infected with 
irrelevant constructs (not shown). Analysis of the substrate specificity of partially purified 

15 

p3Gal-T5 activity showed that all effective substrates contained pGlcNAc at the 
nonreducing end (Table I). 
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Table I Substrate specificity of j33Gal-T5 with saccharide acceptors 



10 



ouosu alC concentration 




i mM 5 mM 






nmol/min/ml 


D-GIcNAc 


0.5 


1.2 


p-D-GlcNAc-Bzl ^ 


1.5 


3.9 


P-D-Gl cNAc-l-/?-Nph 


2.1 


6.9 


P-D-G 1 cNAc- 1 -thio-/?-Nph 


1.1 


3.4 


p-D-GlcNAc-Me-umb 


2.6 


7.5 


p-D-GalNAc-Me-umb 


0.0 


0.0 


a-D-GlcNAc-Bzl 


0.0 


0.0 


a-D-GalNAc-Bzl 


0.0 


0.0 


a-D-Gal-l-o-Nph 


0.0 


0.0 


p-D-Gal-l-o-Nph 


0.0 


0.0 


p-D-Glc-Me-umb 


0.0 


0.0 


p-GaU(l-4)-P-D-Xyl-l-Me.umb ^ 


0.0 


0.0 


p-u-*ji CJN Ac-( 1 -3 >p-D-Cjal- 1 -Me 


27.4 


87.4 


p-D-GlcNAc.(l-3)-a-D-GalNAc-/>-Nph 


10.8 


34.4 


p-D-GlcNAc-( l-6)-a-D-Man-l-Me 


4.0 


13.0 


p-D-GlcNAc-( 1 -2)-a-D-Man 


0.0 


3.0 


p-D-GlcNAc-( 1 -2)-a-D-Man-(l-3).[P-D- 






GlcNAc-( l-2)-a-D-Man-( l-6)-]D-Man 


0.0 


0.0 



^ Enzyme partially purified as described elsewhere herein. 

Bzl, benzyl; Me, methyl; Me-Umb, 4-methyl-umbelliferyl; Nph, nitrophenyl. 
prepared enzymatically using -p-D-Xyl-l-Me-Umb and UDP-Ga! as substrates 
(R. Almeida, H. Clausen, unpublished). 

Among the simple saccharide derivatives tested disaccharide P-D-GlcNAc-(l 
3 0 -3)-P-D-Gal-l-Me was better than all other saccharide derivatives. This in contrast to 
p3Gal-Tl and -T2 which had very low relative activities with disaccharides used as 
substrates (Amado, M., Ahneida, R., Cameiro, F., et al. A family of human p3- 
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galactosyltransferases: characterisation of four members of a UDP-galactose P-N- 
acetylglucosamine/p-N-acetylgalactosamine pl,3-Galactosyltransferase family. J. Biol 
Chem, 273:12770-12778, 1998; Kolbinger, F., Streiff, M.B. and Ktopodis, A.G. Cloning of 
a human UDP-galactose:2- acetamido-2-deoxy-D-glucose 3 P-galactosyl transferase 

5 

catalysing the formation of type 1 chains. J. Biol Chem, 273:433-440, 1998; Hennett, T., 

Dinter, A., Kuhnert, P., Mattu, T.S., Rudd, P.M. and Berger, E.G. Genomic cloning and 

expression of three murine UDP-galactose: p-N-acetylglucosamine P13- 

galactosyltransferase genes. J, Biol Chem, 273:58-65, 1998). p3Gal-T5 showed poor 

activity with saccharide derivatives representing N-linked core structures, namely P-D- 

GlcNAc-(l-6)-a-Man-l-Me, biantennary pentasaccharide and P-D-GlcNAc-(l-2)-a-D-Man. 

Particularly striking was a high relative activity towards P-D-GlcNAc(l-3)-a-D-GalNAc-l- 

/?-Nph, which represents the core 3 O-linked structure. A comparison of relative activities 

of several p3- and p4Gal-transferases with core 3 and core 2 O-linked structures is 

- _ presented in Table II. 
Id 

Table II Activities with mucin-type core 2 and 3 acceptors 



10 



pGlcNAc- pGlcNAc(l,3)aGAlNAc-p- pGlcNAc(l,6)[pGal(l,3)]aGaINAc-p-Nph 
Bzl** Nph 

0.2mM 2inM 0.2niM 2mM 

nmol/min nmol/min nmol/min 



25 



p3GaI.Tl^ 


0.03 


0.0 


(0.0) 


0.0 


(0.0) 


0.0 


(0.0) 


0.0 


(0.0) 


p3Gal-T2 


0.04 


0.0 


(o;o) 


0.0 


(0.0) 


0.0 


(0.0) 


0.0 


(0.0) 


P3Gal-T5 


0.1 


0.3 


(0.3) 


0.2 


(2.0) 


0.0 


(0.0) 


0.01 


(0.1) 


P4Gal-T2 


0.03 


0.2 


(0.6) 


0.01 


(0.3) 


0.03 


(1.0) 




NA 


P4GaI-T3 


0.03 


0,1 


(0.3) 


0.03 


(1.0) 


ND 






ND 



^ Enzyme partially purified as described elsewhere herein. 

^ P-D-GlcNAc-Bzl was used at 80mM with p3Gal-Tl and p3Gal-T2; at 20mM with P3Gal-T5; at 0,25 

30 mM with p4Gal-T2 and at 2inM with P4Gal-T3. ND, not determined. NA. not applicable due to 

inhibition. (), ratio between values obtained with core 2 or 3 and those obtained with PGlcNAc-Bzl. 
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None of the P3Gal-Ts utilize the core 2 substrate and only P3Gal-T5 
catalyzed glycosylation of core 3 substrates. The two p4Gal-Ts tested showed lower 
activity than p3Gal-T5 with the core 3 substrate, however, direct comparison is not 
^ possible. Nevertheless, type 1 chain structures are found on core 3 (van Halbeek, H., 
Borland, L., Vliegenthart, J.F.G.. et al. Primary-structure determination of fourteen neutral 
oligosaccharides derived from bronchial-mucus glycoproteins of patients suffering from 
cystic fibrosis, employing 500-MHz IH-NMR spectroscopy. Eur J Biochem 1-20, 1982), 
but to the best of our knowledge core 2 structures are always extended with type 2 chain N- 
^ ^ acetylactosamine chains . 

Analysis of P3Gal-Ts with glycoprotein acceptors (Table HI) showed that P3 
Gal-T5 only used bovine submaxillary mucin which carries approximately 10% GlcNAc 
terminating core 3 O-linked glycans (Martensson, S., Leveiy, S.B., Fang, T. and Bendiak, 
B. Neutral core oligosaccharides of bovine submaxillary mucin. Use of lead tetraacetate in 
the cold for establishing branch positions. Eur. J. Biochem. 258. 603-622, 1998). 

Table III Substrate specificity of J33galactosyltransf erases with glycoprotein acceptors 



20 



25 



AcccDtor substrate^ 


B3Gal-Tl 




B3Gal-T2 




B3Gal-T5 




mmol/min 




mmol/min 




mmol/min 


P-D-GlcNAc-Bzl 


0.03 




0.04 




0.1 


Hen egg albumin 

Asiaio-agalacto-fetuin 

Bovine submaxillary 
mucin 


0.0 

0.01 

0.0 


(0.0) 
(0.3) 
(0.0) 


0.02 
0.07 
0.0 


(0.5) 
(1.8) 
(0.0) 


0.0 (0.0) 
0.0 (0.0) 
0.04 (0.4) 


Orosomucoid 


0.0 


(0.0) 


0.0 


(0.0) 


0.0 (0.0) 



p-o-GIcNAc-Bzl was used at SOmM for P3Gal-Tl and p3Gal-T2 and at 20mM for p3Gal-T5- ( ) 
ratio between values obtained with glycoproteins and those obtained with P.-D -GIcNAc-Bzl.' ' 

As reported previously and in the present study P3 Gal-T2 utiUzed 
glycoproteins with N-linked glycans while p3Gal-Tl showed no or very low activity with 
30 glycoprotein acceptors (Amado, M., Ahneida, R., Cameiro, F.. et al. A family of human 
P3-galactosyltransferases: characterisation of four members of a UDP-galactose P-N- 
acetylglucosamine/p-N-acetylgalactosamine pi,3-GaIactosyltransferase family. J. Biol. 
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Chem. 273:12770-12778, 1998) (Table III). A similar differential specificity for 
glycoproteins is found among p4Gal-transferases, where P4Gal-Tl, -T2, and -T3 catalyze 
glycosylation of to N-linked glycoproteins, but a novel member, P4Gal-T4, appears to be 
inactive with these substrates (Schwientek, T., Almeida, R., Levery, S.B., Holmes, E., 
Bennett, E.P. and Clausen, H. Cloning of a novel member of the UDP-galactose: P-N- 
acetylglucosamine pi ,4- galactosyltransferase family, p4Gal-T4, involved in 
glycosphingolipid biosynthesis. J.BioIChem, 273:29295-29305,1998). 

Analysis of the catalytic activities with a panel of glycolipid substrates 
revealed that p3Gal-T5 has high activity with GlcNAcpi-3Gaipi-4Glcpi-Cer (Lc3), in 
either taurodeoxycholate or Triton CF-54 (Table IV). 
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' Table IV Substrate specificities with glycolipid acceptors 



p3Gal-T5^ 





Acceptor substrate 


TDOC* 


Triton CF-54 
fimol/h/mg 


GlcCer 


(Glcpl-Cer) 


0.04 


ND 


LacCer 


(Gaipi-4Glcpi-Cer) 


ND 


ND 




(Gal« 1 -4Gaip 1 -4GlcP 1 -Cer) 


ND 


ND 




(GalNAcp 1 -3Gala 1 -4Gaip 1 -4Glcp 1 -Cer) 


0.6 


0.09 


Gg3 


(GalNAcP 1 -4Galp 1 -4GlcP 1 -Cer) 


0.09 


0.005 




(GalNAcP 1 -4 (NeuAca2-3)Ga]p 1 -4GlcP 1 -Cer) 


ND 


ND 


GM, 


(Galp 1 -3GalNAcp 1 -4(NeuAca2-3)Gaip 1 -4Glcp 1 - 
Cer) 


ND 


ND 


Lcj 


(GlcNAcP 1 -3Gaip 1 -4GlcP 1 -Cer) 


4.4 


3.6 


nLc^ 


(Galp 1 -4GlcNAcP 1 -3Galp 1 -4GlcP 1 -Cer) 


ND 


ND 


nLCj 


(GlcNAcp 1 -3GaIp 1 -4GlcNAcpi -3Galp 1 -4GlcP 1 - 
Cer) 


1.6 


0.4 



En2yme partially purified as described elsewhere herein and the specific protein 
concentration was estimated by SDS-PAGE to be 300|ag/ml. 

Assays were performed using lOO^g of taurodeoxycholate (TDOC) or Triton CF-54/lOOnl 
of reaction mixture. ND, not detectable. 



^ 5 Activity was also fovind with nLCj but this was ahnost 3-fold lower than with 

1x3, and activity was significantly lower in Triton CF-54. Interestingly, considerable 
activity was observed with Gb4 and there were detectable incorporation into GlcCer and 
Ggj. The product formed with Gb^ was characterized and found primarily to represent the 
expected Galpl-3Gb4 structure. The apparent Km of p3Gal-T5 for Lc3Cer in the presence 

30 taurodeoxycholate was approximately 2 jiM, but due to substrate inhibition this result was 
only based on data points at low concentrations. 
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The acceptor substrate specificity and kinetic properties of p3Gal-T5 are 
similar to a previously reported porcine tracheal p3Gal-transferase activity (Sheares, B.T. 
and Carlson, D.M. Characterization of UDP-galactose:2-acetamido-2-deoxy-D- glucose 3 
beta-galactosyltransferase firom pig trachea. J. Biol Chem. 258:9893-9898, 1983) and 
hxunan colonic p3Gal-transferase activity (Seko, A., Ohkura, T., Kitamura, H., Yonezawa, 
S., Sato, E. and Yamashita, K. Quantitative differences in GIcNAc:betal— >3 and 
GlcNAc:betal— >4 galactosyltransferase activities between human colonic adenocarcinomas 
and normal colonic mucosa. Cancer Res 56:3468-3473, 1996), Both the porcine and 
human p3Gal-transferase activities have apparent Kms for UDP-Gal of 200-220 nM using 
pGlcNAcpl-3Gal(GalNAc) acceptor substrates, and the secreted recombinant P3Gal-T5 
had an apparent Km of 169 |iM (Table V). 



Table V Kinetic properties of pSGal-TS 



15 



p3Gal-T5^ 



Substrate^ 



Km 



Vmax 



20 



25 



UDP-Gal 
p-D-GlcNAc-Bzl 

P-D-GlcNAc-(l-3)-a-D-GalNAc-/?-Nph 
P-D-GlcNAc-(l -3)-a-D-Gal-Me 



rxiM 
0.169 

20.4 

2.8 

1.8 



pmol/min 
1422.2 

873.4 

931.1 

972.9 



Enzyme partially purified as described elsewhere herein. 
The concentrations used were 25-400 |iM for UDP-Gal, 3.75-60 mM for P-D- 
GlcNAc-Bzl and 0.3125-5 mM for p-D-GlcNAc-(;-3)-a-D-GalNAc-p-Nph and P- 
D-GlcNAc(l -3)-a-D-Gal-Me. 



30 



These relatively high Kms for donor substrates are significantly different 
fi-om those reported for p3Gal-TI and -T2 (90 and 37 ^M, respectively) (Amado, M., 
Almeida, R., Cameiro, F., et al. A family of human P 3 -galactosyltransf erases: 
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Characterisation of four members of a UDP-galactose P-N-acetylglucosamine/p-N- 
acetyjgalactosamine pi, 3-Galactosyltransferase family. J.Biol. Chem. 273:12770-12778, 
1998). Interestingly, activity of the fiill length coding construct of p3Gal-T5 analyzed in 
g Triton CF-54 homogenates of infected insect cells showed a lower apparent Km of 33 
for the donor substrate (not shown). The purified P3GaI-transferase activity analyzed by 
Sheares, et al. (Shears, B.T. and Carlson, D.M. Characterization of UDP-gaIactose:2- 
acetamido-2-deoxy-D- glucose 3 beta-galactosyltransferase fi-om pig trachea. J. Biol. 
Chem. 258:9893-9898, 1983) is, however, likely to represent a truncated proteolytically 
cleaved form that is often found with affinity-purified glycosyltransferase preparations 
(Clausen, H., White, T., Takio, K., et al. Isolation to homogeneity and partial 

characterization of a histo-blood group A defined Fuc alpha 1 — 2Gal alpha 1 3-N- 

acetylgalactosaminyltransferase fi-om human lung tissue. J.Biol. Chem. 265:1139-1145, 
1990). Moreover. Hohnes (Hohnes, E.H. Characterization and membrane organization of 
^5 beta 1— -3- and beta 1—4- galactosyltransferases fi-om human colonic adenocarcinoma cell 
lines Cob 205 and SW403: basis for preferential synthesis of type 1 chain lacto-series 
carbohydrate structures. Arch Biochem Biophys 270:630-646, 1 989) reported that non- 
purified P3GaI-T activity fi-om Colo205 cells had an apparent Km for UDP-Gal of 48 nM 
using glycolipids as acceptor substrate. This preparation may contain both fiill and secreted 
20 forms of transferases. The recombinant fiill length form of p3Gal-T5 resembled the 
recombinant secreted form in all other aspects tested. The porcine p3Gal-transferase 
activity has an apparent Km for core 3 of 2.4 mM and p3GaI-T5 exhibited an apparent Km 
for core 3 of 2.8 mM. Hohnes (Holmes. E.H. Characterization and membrane organization 
of beta 1-— 3- and beta 1—4- galactosyltransferases from human colonic adenocarcinoma 
25 cell lines Cob 205 and SW403: basis for preferential synthesis of type 1 chain lacto-series 
carbohydrate structures. Arch Biochem Biophys 270:630-646, 1989) reported a Km for 
Lc3Cer of 13nM for p3Gal-T activity from Colo205 cells. The best substrate identified for 
p3Gal-T5 was P-D-GlcNAc(l-3)-D-P-Gal-l-Me [apparent Km of 1.8 mM (Table V)]. This 
is similar to the apparent Km of 2.9 mM for human colonic p3 Gal-T activity for p-D- 
30 GlcNAc(l-3)-D-p-Gal(1.4)-D-P-Glc (Seko, A., Ohkura, T., Kitamura, H., Yonezawa. S.. 
Sato, E. and Yamashita, K. Quantitative differences in GIcNAc:betal->3 and 
GlcNAc:betal->4 galactosyltransferase activities between human colonic adenocarcinomas 
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and nornial colonic mucosa. Cancer Res 56:3468-3473, 1996). P3Gal-T5 showed strict 
donor substrate specificity for UDP-Gal and did not utilize UDP-GalNAc or UDPGlcNAc 
with the acceptor substrates tested (data not shown). 

Expression of the full coding construct of P3Gal-T5 in Sf9 cells 60 hours 
postinfection resulted in virtually all p3Gal-transferase activity retained on cells (Table VI). 



Table VI Expression of full coding constructs of j83Gal-TJ and jS3Gal'T5 



15 



P3Gal-Tl^ 


p3Gal-T5 


Cells Media 


Cells Media 


nmol/min/ml 

p-D-GlcNAc- 7.2 0.2 


nmol/min/ml 
1.5 0.1 


a p-D-GlcNAcBzl was used at 20mM. 




b Enzyme sources were media or 1% 




Triton CF54 cell homogenates from 




PGal-Tl and -T5 transfected Sf9 




cells, harvested 60 hrs post-infection. 





This was also found for p3Gal-Tl (Table VI), and the same has been found 
2 0 for the other p3Gal-Ts as well as for a number of p4Gal-Ts and polypeptide GalNAc- 

transferases (not shown). In contrast, more than 50 % of the enzyme activity is found in the 
media after 60 hoiurs of transfection when truncated secreted constructs are used. 



2 5 5.5. 'H- and *^C-NMR SPECTROSCOPY OF PRODUCT FORMED 

GLYCOSYLATION OF CORE3-p-NPh WITH p3GAL-T5 
The product derived from reaction of p3Gal-T5 with 
GlcNAcpl-3GalNAcal-lpNp was characterized by NMR spectroscopy to confirm that the 
proper linkage was formed between the donor sugar and the acceptor substrate. 

3 0 Comparison of a 1-D 'H-NMR spectrum of the product (Fig. 4) with that of the substrate 

(not shown) clearly showed an additional H-1 resonance (4.467 ppm) from a sugar residue 
linked in the P-configuration C V; ^ = 7-9 Hz). This was accompanied by a downfield shift 
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of the p-GIcNAc H-1 resonance to 4.7 19 ppm (A6 0.065), as expected upon glycosylation 
of that residue. However, anomeric chemical shift criteria alone are insufficient for 
determining the identity and linkage position of the newly added residue. Since we were 
^ unable to find NMR data for the para-nitrophenyl glycosides of either the Core 3 substrate 
or the expected GalpSCore 3 product in the literature or in glycoconjugate NMR databases, 
and since the substantial anisotropic effects of the paranitrophenyl group obviate direct 
comparison of chemical shift data with those of the benzylglycosides (Pollex-Kruger, A., 
Meyer, B., Stuike-Pill, R., Sinnwell, V., Matta, K.L. and Brockhaiisen, I. Preferred 
conformations and dynamics of five core structures of mucin type O-glycans determined by 
NMR spectroscopy and force field calculations. Glycoconjugate J 10:365-380, 1993)). 
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20 



25 



30 
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Table VII ^H, '*^C chemical shifts (ppm) and ^H-H coupling constants (Hz) for CoreS-p- 
Nph substrate and biosynthetic Gal^-CoreS-p-Nph product in D2O at 25 X^, 



10 





Core 3 




Galpl-3-Core3 






GlcNAcB 

3 




Gaip3 


GlcNAcp3 


GalNAca 


H-l" 




5.785 


4.467 


4.719 


5.787 


H-2 


3,734 


4.502 


3.525 


3.866 


4.505 


H-3 


CO A 

3.584 


4.237 


3.651 


3.854 


4.253 


H-4 


3.456 


4.291 


3.919 


3.589 


4.303 


H-5 


3.453 


4.002 


3.721 


3.498 


4.002 


H-6R 


3.780 


3.732 


3.764 


3.800 


3.731 


H-6S 


3.918 


3.679 


3.764 


3.918 


3.681 


H-8 


2.033 


2.036 


N.A.'' 


2.026 


2.037 


(Me) 














8.2 


3.6 


8.0 


8.0 


3.7 


•^2.3 


10.2 


11.3 


10.3 


N.F.O.' 


10.9 




8.2 


3.1. 


3.6 


8.0 


3.0 


^5 


9.8 


<1.5 


<1.5 


10.1 


<1.5 




5.1 


7.7 


N.D.'* 


5.1 


8.0 


•^5,68 


2.1 


4.6 


N.D. 


2.2 


4.4 




-12.3 


110 
-11.8 


N.F.O. 


-12.4 


-11.7 


C-1 


102.33 


95.51 


103.24 


101.99 


95.48 


C-2 


55.38 


47.80 


70.41 


54.45 


47.76 


C-3 


73.17 


75.88 


72.31 


81.86 


76.01 


C-4 


69.53 


68.36 


68.29 


68.25 


68.27 


C-5 


75.48 


71.63 


75.10 


74.94 


71.61 


C-6 


60.25 


60.69 


60.75 


60.21 


60.63 


C-1 


174.28 


173.61 


N.A. 


N.D. 


N.D. 


(C=0) 












C-8 


21.99 


21.76 


N.A. 


21.90 


21.79 


(Me) 













chemical shifts are referenced to internal acetone (2.225 and 30.00 ppm for *H and 
'^C, respectively). 
N.A., not applicable. 
N.F.O. non-first-order. 
N.D., not determined. 



30 



Analysis of coupling constant data confirmed that the additional residue was 
a P-Gal (Vj,^ <1 .5 Hz). The 1^3 linkage was confirmed by the following criteria: (i) the 
largest glycosylation-induced chemical shift change among the core3 protons was observed 
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for P-GlcNAc H-3 (A6 = 0.270); (ii) consistent with this, in a 'H-'H ROESY spectrum of 
the product (not shown), the strongest rotating frame Overhauser enhancement observed 
from P-Gal H- 1 was to P-GIcNAc H-3; (iii) no other inter-residue correlations were 
^ observed originating from P-Gal H-1, and no ambiguity is introduced into interpretation of 
the ROESY spectrum by the near degeneracy of P-GIcNAc H-2 and H-3 in the product, 
since there is no potential glycosylation site at C-2; (iv) comparison of '^C spectral data for 
the substrate and product showed only one glycosylation-induced significant dovmfield 
shift, for P-GlcNAc C-3 (A6 8.69). The magnitude of the shift change is essentially 
diagnostic for glycosylation at that site. 

The product foimed with Gb^ was characterized by 1-D 'H-NMR 
spectroscopy (not shown); although more than one component was detected, five anomeric 
resonances were clearly observed for the major component, with chemical shifts and V, , 
coupling constants virtually identical to those obtained previously for Gaip I ->3Gb4 
^5 (Kamiagi. R., Levexy, S.B.. Ishigami, F.. et al. New globosides glycosphingolipidi in 
human teratocarcinoma reactive with the monoclonal antibody directed to a 
developmentally regulated antigen, stage-specific embryonic antigen 3. J. Biol Chem 
258:8934-8942, 1983). These were 4.810 ppm (V,, = 3.6 Hz), 4.620 ppm ^J., = 8.7 Hz) 
4.267 ppm (V,^ = 7.4 Hz), 4.198 ppm (V,, = 7.9 Hz), and 4.173 ppm (V,, = 7.9 Hz), 
2 0 corresponding to H-1 of Gala4. GalNAcps, Gaip4, Gaip3, and Glcpi. respectively, of the 
Gaipi-^3Gb, sequence. Anomeric resonances from some unreacted Gb^ were also detected 
in the product. The identity of a third, minor component, separable by preparative HPTLC, 
is currently imder investigation. 

25 5.6. NORTTTFR N ANAT.YSIS OF p l^HAr-T^ 

Northern analysis of multiple tissue northern (MTN) blots from Clontech 
failed to produce signals in several attempts. Sequence analysis suggested that the 
transcript could exceed 10 kilobase (kb), based on the finding that the first upstream 
polyadenylation consensus signal. Therefore, an absence of signal on the commercial blots 

3 0 could be explained by poor transfer of large mRNAs. A blot was prepared with total RNA 
from human carcinoma cell lines, and care was taken to insure efficient transfer of long 
mRNA species. This yielded hybridizing bands at 12 kb or bigger for three cell lines: 
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AsPC-1, HPAF, Suit2, and S2-013. Interestingly, apart from the single EST identified for 
the coding region of p3Gal-T5, no ESTs derived from any part of the 3*UTR of the 
approximate 10 kb region have been included in the EST databases. It is unclear at this 
time why this protein of average mass is encoded by a 12 kb mRNA transcript. 

5 

5J. DNA. VECTORS. AND HOST CELLS FOR B3GAL-T5 

In practicing the present invention, many conventional techniques in 
molecular biology, microbiology, recombinant DNA, and immunology, are used. Such 
^ techniques are well known and are explained fully in, for example, Sambrook et al., 1989, 
Molecular Cloning. A Laboratory Manual Second Edition, Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, New York; DNA Cloning: A Practical Approach Volumes I 
and II, 1985 (D.N. Glover ed.); Oligonucleotide Synthesis, 1984, (M.L. Gait ed.); Nucleic 
Acid Hybridization, 1985, (Hames and Higgins); Transcription and Translation, 1984 
(Hames and Higgins eds.); y^mwa/ Ce// Cw/rwre, 1986 (R.I. VT^%\moy Immobilized 

5 

Cells and Enzymes , 1986 (IRL Press); Perbal, 1984, A Practical Guide to Molecular 
Cloning: the series. Methods in Enzymology (Academic Press, Inc.); Gene Transfer Vectors 
for Mammalian Cells, 1987 (J, H. Miller and M. P. Carlos eds., Cold Spring Harbor 
Laboratory); Methods in Enzymology Vol. 154 and Vol. 155 (Wu and Grossman, and Wu, 

Q eds., respectively); Immunochemical Methods in Cell and Molecular Biology, 1 987 (Mayer 
and Waler, eds; Academic Press, London); Scopes, 1987, Protein Purification: Principles 
and Practice, Second Edition (Springer-Verlag, N.Y.) and Handbook of Experimental 
Immunology, 1986, Volumes I-IV (Weir and Blackwell eds.); Ausubel et al., eds., in the 
Current Protocols in Molecular Biology series of laboratory technique manuals, © 1987- 

5 1997 Current Protocols, © 1994-1997 John Wiley and Sons, Inc.); and Dyson, N.J., 1991, 
Immobilization of nucleic acids and hybridization analysis. In: Essential Molecular 
Biology: A Practical Approach, Vol. 2, T.A. Brown, ed., pp. 111-1 56, IRL Press at Oxford 
University Press, Oxford, U.K.; each of which is incorporated by reference herein in its 
entirety). 

Q The invention encompasses isolated nucleic acid fragments comprising all or 

part of the nucleic acid sequence disclosed herein as set forth in Figure 1. The fragments 
are at least about 8 nucleotides in length, preferably at least about 12 nucleotides in length, 
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and preferably at least about 15-20 nucleotides in length. Further, such fragments may be at 
least about 50, 100, 200, 500, 1000, 2000, 5000, or 10,000 nucleotides in length. The 
invention further encompasses isolated nucleic acids comprising sequences that are 
^ hybridizable under stringency conditions of 2X SSC, 55 °C, to the sequence set fourth in 
Figure 1; preferably, the nucleic acids are hybridizable at 2X SSC, 65 °C; and most 
preferably, are hybridizable at 0.5X SSC, 65 °C. 

The nucleic acids may be isolated directly from cells. Alternatively, the 
polymerase chain reaction (PCR) method can be used to produce the nucleic acids of the 
invention, using either chemically synthesized strands or genomic material as templates . 
Primers used for PCR can be synthesized using the sequence information provided herein 
and can further be designed to introduce appropriate new restriction sites, if desirable, to 
facilitate incorporation into a given vector for recombinant expression. 

The nucleic acids of the present invention may be flanked by natural human 
regulatory sequences, or may be associated with heterologous sequences, including 
promoters, enhancers, response elements, signal sequences, polyadenylation sequences, 
introns, 5'- and 3'- noncoding regions, and the like. The nucleic acids may also be 
modified by many means known in the art. Non-limiting examples of such modifications 
include methylation, "caps", substitution of one or more of the naturally occurring 
2Q nucleotides with an analog, intemucleotide modifications such as, for example, those with 
uncharged linkages (e.g., methyl phosphonates, phosphotri esters, phosphoroamidates, 
carbamates, etc.) and with charged linkages (e.g., phosphorothioates, phosphorodithioates, 
etc.). Nucleic acids may contain one or more additional covalently linked moieties, such as, 
for example, proteins (e.g., nucleases, toxins, antibodies, signal peptides, poly-L-lysine, 
25 etc.), intercalators (e.g., acridine. psoralen, etc.), chelators (e.g., metals, radioactive metals, 
iron, oxidative metals, etc.), and alkylators. The nucleic acid may be derivatized by 
formation of a methyl or ethyl phosphotriester or an alkyl phosphoranlidate linkage. 
Furthermore, the nucleic acid sequences of the present invention may also be modified with 
a label capable of providing a detectable signal, either directly or indirectly. Exemplary 
30 labels include radioisotopes, fluorescent molecules, biotin, and the hke. 

According to the present invention, useful probes comprise a probe sequence 
at least eight nucleotides in length that consists of all or part of the sequence from among 
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the sequences as set forth in Figure 1 or sequence-conservative or function-conservative 
variants thereof, or a complement thereof, and that has been labelled as described above . 

The invention also provides nucleic acid vectors comprising the disclosed 
^ sequence or derivatives or fragments thereof. A large number of vectors, including plasmid 
and fungal vectors, have been described for replication and/or expression in a variety of 
eukaryotic and prokaryotic hosts, and may be used for gene therapy as well as for simple 
cloning or protein expression. 

Recombinant cloning vectors will often include one or more replication 
systems for cloning or expression, one or more markers for selection in the host, e.g. 
antibiotic resistance, and one or more expression cassettes . The inserted coding sequences 
may be synthesized by standard methods, isolated from natiu-al sources, or prepared as 
hybrids, etc. Ligation of the coding sequences to transcriptional regulatory elements and/or 
to other amino acid coding sequences may be achieved by known methods. Suitable host 
cells may be transformed/ transfected/infected as appropriate by any suitable method 
including electroporation, CaClj mediated DNA uptake, fungal infection, microinjection, 
microprojectile, or other established methods. 

Appropriate host cells included bacteria, archebacteria, fungi, especially 
yeast, and plant and animal cells, especially mammalian cells. Of particular interest are 

2Q Saccharomyces cerevisiae, Schizosaccharomyces pombi, SF9 cells, CI 29 cells, 293 cells, 
Neurospora, and CHO cells, COS cells, HeLa cells, and immortalized mammalian myeloid 
and lymphoid cell lines. Preferred replication systems include Ml 3, ColEl, SV40, 
baculovirus, lambda, adenovirus, and the like. A large number of transcription initiation 
and termination regulatory regions have been isolated and shown to be effective in the 

25 transcription and translation of heterologous proteins in the various hosts . Examples of 
these regions, methods of isolation, manner of manipulation, etc. are known in the art. 
Under appropriate expression conditions, host cells can be used as a source of 
recombinantly produced p3Gal-T5 derived peptides and polypeptides. 

Advantageously, vectors may also include a transcription regulatory element 

3 Q (i.e., a promoter) operably Unked to the p3Gal-T5-coding portion. The promoter may 

optionally contain operator portions and/or ribosome binding sites. Non-limiting examples 
of bacterial promoters compatible with E. coli include: p-lactamase (penicillinase) 
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promoter; lactose promoter; tryptophan (tip) promoter; arabinose BAD operon promoter; 
lambda-derived P, promoter and N gene ribosome binding site; and the hybrid tac promoter 
derived from sequences of the tip and lac UV5 promoters. Non-limiting examples of yeast 
^ promoters include 3-phosphogIycerate kinase promoter, glyceraldehyde-3 phosphate 
dehydrogenase (GAPDH) promoter, galactokinase (GALI) promoter, galactoepimerase 
promoter, and alcohol dehydrogenase (ADH) promoter. Suitable promoters for mammalian 
cells include without limitation viral promoters such as that from Simian Virus 40 (SV40), 
Rous sarcoma virus (RSV), adenovirus (ADV), and bovine papilloma virus (BPV). 
Mammalian cells may also require terminator sequences and poly A addition sequences and 
enhancer sequences which increase expression may also be included; sequences which 
cause amplification of the gene may also be desirable. Furthermore, sequences that 
facihtate secretion of the recombinant product from cells, including, but not limited to, 
bacteria, yeast, and animal cells, such as secretory signal sequences and/or prohormone pro 
^5 region sequences, may also be included. These sequences are known in the art. 

Nucleic acids encoding wild-type or variant polypeptides may also be 
introduced into cells by recombination events. For example, such a sequence can be 
introduced into a cell, and thereby effect homologous recombination at the site of an 
endogenous gene or a sequence with substantial identity to the gene. Other recombination- 
20 based methods such as nonhomologous recombinations or deletion of endogenous genes by 
homologous recombination may also be used. 

The nucleic acids of the present invention find use, for example, as probes 
for the detection of or related organisms and as templates for the recombinant production of 
peptides or polypeptides. These and other embodiments of the present invention are 
2 5 described in more detail below. 

5.8. POLYPEPTIDES OF p ^HAT -T^ 

The present invention encompasses isolated peptides (generally defined as a 
polypeptide having less than 50 amino acid residues) and polypeptides encoded by the 
30 disclosed nucleic acid sequence. Peptides are preferably at least five residues in length. 
Peptides or polypeptides may be, for example. 6, 10, 15, 30, 50, 100, 200, or 300 residues in 
length. 
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Nucleic acids comprising protein-coding sequences can be used to direct the 
recombinant expression of polypeptides in intact cells or in cell-free translation systems. 
The known genetic code, tailored if desired for more efficient expression in a given host 
organism, can be used to synthesize oligonucleotides encoding the desired amino acid 
sequences. The phosphoramidite solid support method of Matteucci a/., 1981, J. Am. 
Chem. Soc. 103:3185, the method of Yoo a/., 1989, J. Bid, Chem. 764:17078, or other 
well known methods can be used for such synthesis. The resulting oligonucleotides can be 
inserted into an appropriate vector and expressed in a compatible host organism. 

The polypeptides of the present invention, including function-conservative 
variants of the disclosed sequence, may be isolated from native or from heterologous 
organisms or cells (including, but not limited to, bacteria, fungi, insect, plant, and 
mammalian cells) into which a protein-coding sequence has been introduced and expressed. 
Furthermore, the polypeptides may be part of recombinant fusion proteins. 

Methods for polypeptide purification are well-known in the art, including, 
without limitation, preparative disc-gel elctrophoresis, isoelectric focusing, HPLC, 
reversed-phase HPLC, gel filtration, ion exchange and partition chromatography, and 
countercurrent distribution. For some purposes, it is preferable to produce the polypeptide 
in a recombinant system in which the protein contains an additional sequence tag that 
2 Q facilitates purification, such as, but not limited to, a polyhistidine sequence. The 

polypeptide can then be purified from a crude lysate of the host cell by chromatography on 
an appropriate solid-phase matrix. Alternatively, antibodies produced against a protein or 
against peptides derived therefrom can be used as purification reagents. Other purification 
methods are possible. 

2 5 The present invention also encompasses derivatives and homologues of 

polypeptides. For some purposes, nucleic acid sequences encoding the peptides may be 
altered by substitutions, additions, or deletions that provide for functionally equivalent 
molecules, i.e., function-conservative variants. For example, one or more amino acid 
residues within the sequence can be substituted by another amino acid of similar properties, 

3 Q such as, for example, positively charged amino acids (arginine, lysine, and histidine); 

negatively charged amino acids (aspartate and glutamate); polar neutral amino acids; and 
non-polar amino acids. 
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The isolated polypeptides may be modified by, for example, 
phosphorylation, sulfation, acylation, or other protein modifications. They may also be 
modified with a label capable of providing a detectable signal, either directly or indirectly, 
^ including, but not limited to, radioisotopes and fluorescent compounds. 

5.9. ANTIBOniES OF p iOAT-T^ 

The present invention encompasses various antibodies that specifically 
recognize immunogenic components derived fi-om p3Gai-T5. Such antibodies can be used, 
^ Q for example, as reagents for detection and purification of p3Gal-T5. 

P3Gal-T5 specific antibodies according to the present invention include 
polyclonal, monoclonal and humanized antibodies, as well as fi-agments and derivatives 
thereof The antibodies of the invention may be elicited in an animal host by immunization 
with P3 Gal-T5 components or may be formed by in vitro immunization of immune cells. 
^5 The immunogenic components used to elicit the antibodies may be isolated from human 
cells or produced in recombinant systems. The antibodies may also be produced in 
recombinant systems programmed with appropriate antibody-encoding DNA. 
Alternatively, antibodies maybe constructed by biochemical reconstitution of purified 
heavy and hght chains. Antibodies of the invention include hybrid antibodies (i.e., 
2 0 containing two sets of heavy chain/light chain combinations, each of which recognizes a 
different antigen), chimeric antibodies (i.e., in which either the heavy chains, light chains, 
or both, are fiision proteins), and univalent antibodies (i.e., comprised of a heavy chain/light 
chain complex bound to the constant region of a second heavy chain). Also included are 
Fab fragments, including Fab' and F(ab)2 fragments of antibodies, single chain antibodies, 
25 anti-idiotypic (anti-Id) antibodies, and epitope-binding antibody fragments. Methods for 
the production of all of the above types of antibodies and derivatives are well-known in the 
art. For example, techniques for producing and processing polyclonal antisera are 
disclosed in Mayer and Walker. 192,1 , Immunochemical Methods in Cell and Molecular 
Biology. (Academic Press, London). Further description of the polyclonal, monoclonal, 
30 chimeric and humanized antibodies of the invention is set forth below. 

Polyclonal antibodies of the invention are heterogeneous populations of 
antibody molecules derived from the sera of immunized animals. Various procedures well 

- 30 - 



BNSDCCID: <WO OOZSSSaAl 



wo 00/29558 



PCT/US99/26807 



known in the art may be used for the production of polyclonal antibodies to p3Gal-T5 and 
fragments thereof. For the production of polyclonal antibodies, various host animals can be 
immunized by injection with P3Gal-T5 or a fragment or derivative thereof, including but 
^ not limited to rabbits, mice, rats, etc. Various adjuvants may be used to increase the 
immunological response, depending on the host species, and including but not limited to 
Freund*s (complete and incomplete), mineral gels such as aluminum hydroxide, surface 
active substances such as lysolecithin, polyanions, peptides, oil emulsions, keyhole limpet 
hemocyanins, dinitrophenol, and potentially useful human adjuvants such as BCG (bacille 
Calmette-Guerin) and corynebacterium parvum. Such adjuvants are also well known in the 
art. 

Monoclonal antibodies of the invention are homogeneous populations of 
antibodies to a particular antigen. A monoclonal antibody (mAb) to P3Gal-T5 or a 
fragment or derivative thereof can be prepared by using any technique known in the art 
which provides for the production of antibody molecules by continuous cell lines in culture. 
These include but are not limited to the hybridoma technique originally described by Kohler 
and Milstein (1975, Nature 256, 495-497), and the more recent human B cell hybridoma 
technique (Kozbor et al., 1983, Immunology Today 4, 72), and the EBV-hybridoma 
technique (Cole et al., 1985, Monoclonal Antibodies and Cancer Therapv , Alan R. Liss, 

2 Q Inc., pp. 77-96). Such antibodies may be of any immunoglobulin class including IgG, IgM, 

IgE, IgA, IgD and any subclass thereof. The hybridoma producing the mAbs of use in this 
invention may be cultivated in vitro or in vivo. 

Monoclonal antibodies of the invention include but are not limited to human 
monoclonal antibodies. Human monoclonal antibodies may be made by any of numerous 
25 techniques known in the art {e.g., Teng et al., 1983, Proc. Nat'1 Acad. Sci. U.S.A. 80, 7308- 
7312; Kozbor et al., 1983. Immunology Todav 4. 72-79; Olsson et al., 1982, Meth. 
Enzvmol. 92. 3-16). 

This invention provides chimeric antibodies specific for P3Gal-T5 or a 
fragment or derivative thereof. A chimeric antibody is a molecule in which different 

3 Q portions are derived from different animal species, such as those having a variable region 

derived from a murine mAb and a human immunoglobulin constant region. Various 
techniques are available for the production of such chimeric antibodies {see, e.g., Morrison 

- 31 - 



BNSCXXIID: <WO 0029558A1J_> 



wo 00/29558 



PCTAJS99/26807 



et aL, 1984, Proc. Nat'l Acad. Sci. TJ.S A 81, 6851-6855; Neuberger et al., 1984, Nature, 
312, 604-608; Takeda et al., 1985, Nature . 314, 452-454) by splicing the genes from a 
mouse antibody molecule of appropriate antigen specificity together with genes from a 
^ hxmian antibody molecule of appropriate biological activity. 

This invention provides humanized antibodies specific for P3Gal-T5 or a 
fragment or derivative thereof. Briefly, humanized antibodies are antibody molecules from 
non-human species having one or more complementarily determining regions (CDRs) from 
the non-human species and a framework region from a human immunoglobulin molecule. 
Various techniques have been developed for the production of humanized antibodies (see, 
e.g.. Queen, U.S. Patent No. 5,585,089, which is incorporated herein by reference in its 
entirety). An immunoglobulin hght or heavy chain variable region consists of a 
"firework" region interrupted by three hypervariable regions, referred to as 
complementarily determining regions (CDRs). The extent of the framework region and 
CDRs have been precisely defined (see, Kabat et al., 1983, Sequences of proteins of 
immunological interest, U.S. Department of Health and Human Services). 

Further, techniques described for the production of single chain antibodies 
(U.S. Patent No. 4,946,778; Bird, 1988, Science 242, 423-426; Huston et al., 1988, Proc. 
Nat'l. Acad. Sci. TT.S A 85, 5879-5883; and Ward et al., 1989, Nature 334, 544-546) can be 
2 0 adapted to produce single chain antibodies specific for p3Gal-T5 or a fragment or derivative 
thereof Single chain antibodies are formed by linking the heavy and light chain fragments 
of the Fv region together via an amino acid bridge, resulting in a single chain polypeptide. 

Antibody fragments which recognize specific epitopes of P3Gal-T5 or a 
fragment or derivative thereof may be generated by known techniques. For example, such 
25 fragments include but are not Umited to: the F(ab')2 fragments which can be produced by 
pepsin digestion of the antibody molecule and the Fab fragments which can be generated by 
reducing the disulfide bridges of the F(ab')2 fragments. Alternatively, Fab expression 
libraries may be constructed (Huse et al., 1989, Science . 246, 1275-1281) to allow rapid and 
easy identification of monoclonal Fab fragments with the desired specificity. 

30 
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Further, general methods of antibody production and use are suitable for the 
antibodies of the invention. For example see Harlow and Lane, 1988, Antibodies: A 
Laboratory Manual , Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York, 
^ which is incorporated herein by reference in its entirety. 

The antibodies of the invention can be purified by standard methods, 
including but not limited to preparative disc-gel electrophoresis, isoelectric focusing, 
HPLC, reversed-phase HPLC, gel filtration, ion exchange and partition chromatography, 
and countercurrent distribution. Purification methods for antibodies are disclosed, e.g., in 
The Art of Antibody Purification, 19S9,Amicon Division, W Grace & Co. General 
protein purification methods are described in Protein Purification: Principles and Practice, 
R.K. Scopes, Ed., 1987, Springer- Verlag, New York, New York. 

Anti-p3Gal-T5 antibodies, whether unlabeled or labeled by standard 
methods, can be used as the basis for immxmoassays. The particular label used will depend 
upon the type of immunoassay used. Examples of labels that can be used include, but are 
not limited to, radiolabels such as ^^P, ^H and '"^C; fluorescent labels such as fluorescein 
and its derivatives, rhodamine and its derivatives, dansyl and umbelliferone; 
chemiluminescers such as luciferia and 2,3-dihydrophthal-azinediones; and enzymes such 
as horseradish peroxidase, alkaline phosphatase, lysozyme and glucose-6-phosphate 
2Q dehydrogenase. 

The antibodies can be tagged with such labels by known methods. For 
example, coupling agents such as aldehydes, carbodiimides, dimaleimide, imidates, 
succinimides, bisdiazotized benzadine and the like may be used to tag the antibodies with 
fluorescent, chemiluminescent or enzyme labels. The general methods involved are well 
25 known in the art and are described in, e.g., Chan (Ed.), 1987, Immunoasscry: A Practical 
Guide, Academic Press, Inc., Orlando, FL. 

The invention described and claimed herein can be fiirther appreciated by 
one skilled in the art through reference to the examples which follow. These examples are 
provided merely to illustrate several aspects of the invention and shall not be constmed to 
3 Q limit the invention in any way. 



- 33 - 



BNSDOCID: <WO 0029558A1J_> 



wo 00/29558 



PCTAJS99/26807 



6. EXAMPLES 

Using BLAST analysis of an EST database, we identified a total often 
candidate human homologous members of the p3Gal-T gene family including the four 
^ members previously reported (Amado, M., Almeida, R., Cameiro, F., et al. A family of 
human P3-galactosyltransferases: characterisation of four members of a UDP-galactose P- 
N-acetylglucosamine/p-N-acetylgalactosamine pi,3-Galactosyltransferase family. J. Biol. 
Chem. 273:12770-12778, 1998). Analysis of sequence similarity of the first four members 
revealed features indicative of functions of encoded enzymes, including conservation of 
cysteine residues, spacing of conserved motifs, and hydropathy profiles (Amado, M., 
Ahneida, R., Cameiro, F., et al. A family of human P3-galactosyltransferases: 
characterisation of four members of a UDP-galactose p-N-acetylglucosamine/p-N- 
acetylgalactosamine pi,3-Galactosyltransferase family. J.Biol. Chem. 273:12770-12778, 
1998). P3Gal-T4 differed significantly from p3Gal-Tl, -T2, and -T3 in this respect, and the 
function of this enzyme was different in that the acceptor saccharide was pGalNAc in 
ganglioseries glycohpids (Miyaki, H., Fukumoto, S., Okada, M., Hasegawa, T. and 
Furukawa, K. Expression cloning of rat cDNA encoding UDP-galactose G(D2) pi, 3 
galactosyltransferase that determines the expression of G(D1 b)/G(M l)G(Al). J. Biol. 
Chem. 272:24794-24799, 1997; Amado, M., Almeida, R., Cameiro, F., et al. A family of 
2 Q human p3-galactosyltransferases: characterisation of four members of a UDP-galactose p- 
N-acetylglucosamine/p-N-acetylgalactosamine pi,3-Galactosyltransferase family. J. Biol. 
Chem. 273:12770-12778, 1998). 

A sequence derived from an EST clone (GenBank accession number 
AJ003597) was predicted to represent a new gene encoding a p3Gal-T forming the Galpl- 
2 5 3GlcNAc linkages. This report describes the cloning and expression of this gene, 
designated p3Gal-T5, and demonstrates that the encoded enzyme has better kinetic 
properties than those of the previously cloned p3Gal-Ts. p3Gal-T5 is a candidate for the 
P3Gal-T activity foimd in epithelia. 



30 6.1. IDENTIFICATION AND CLONING OF p3Gal-Tf> 

The BLASTn and tBLASTn were used with the coding sequence of human 
p3Gal-T2 to search the dbEST database at The National Centre for Biotechnology 
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Information (NCBI, USA) as previously described (Almeida, R., Amado, M., David, L., at 
al. A Family of Human P4-Galactosyltransferases: Cloning and expression of two novel 
UDP-GalactOSe p-N-Acetylglucosaflhine pl,4-GalactosyltransferaSes, p4Gal-T2 and 
p4Gal-T3. J.Biol.Chem. 272:31979-31992,1997). One EST (GenBank accession number 

5 

AJ003597) was identified as representing a putative novel p3Gal-T gene. Since the coding 
regions of all other cloned members of the human P3Gal-T gene family were found to be 
encoded in a single exon, we used the EST sequence information to design primers for PGR 
screening of a PI genomic library. A human foreskin PI library (DuPont Merck 
Pharmaceutical Company Human Foreskin Fibroblast PI Library) was screened using the 
primer pairs EBER 1301 (5'-CTTCCTTAAGCTCCCAGATAC 3') (SEQ ID NO:l) and 
EBER 1302 (5*- GTTTCCGCTGCACTGCTGGTG 3') (SEQ ID NO:2). One PI clone for 
P3Gal-T5 (DMPC-HFF#1-1 195h3) as well as DNA from PI phages were obtained from 
Genome Systems Inc. Sequencing of this PI DNA revealed an open reading frame of 933 
bp encoding a putative protein with a type n domain structure (Fig. 1). The entire coding 
sequence of p3Gal-T5 was fully sequenced using automated sequencing (ABI377, Perkin 
Elmer) with dye terminator chemistry. The EST clone AJ003597 was derived from a 
chromosome 21 library, and subsequently a 165 kilobase pair PAC sequence containing the 
entire sequence of p3Gal-T5 was linked to 21 q22.3 (GenBank accession number- 
AF064860). The EST sequence did not appear to be derived from correct oligo-dT priming, 
and analysis of the genomic PAC sequence showed that the first downstream consensus 
polyadenylation signal (AATAAA) was 9568 bp from the first initiation codon. The 
putative 3' UTR sequence contained repeats and potential short coding regions, but none of 
the coding regions showed similarity to known genes. No ESTs from the 3 'UTR have been 

25 deposited in the GenBank database. A second consensus polyadenylation signal is found 
2991 bp downstream of the first, and a few 3' ESTs have been identified from this site and 
mapped (STS-N41029), but no sequence encoding protein with similarity to known genes 
have been assigned from this region. 

The EST sequence (AJ003597) is 338 nucleotides long. Nucleotides 1-312 

3 Q of AJ003597 encode the complement of nucleotides 38-349 of the coding region of P3Ga]- 
T5 (Figure 1) (nucleotides 1 16-427 of SEQ ID NO:8), 
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6.2. EXPRESSTON OF p :^nal-Tf; 
What follows are examples of expression of p3Gal-T5 in insect cells, and as 
a full-length or partial-length (soluble) gene product in CHO cells. 

5 

EXPRESST ON OF pSfial-TS IN INSECT CFJ J S 
An expression construct (pAcGP67-p3Gal-T5-sol) designed to exclude the 
hydrophobic transmembrane segment and to encode amino acid residues 25-310, was 
prepared by PGR using PI genomic DNA, and the primer pair EBER1300 sol (5'- 
ATGTACAGTCTAAATCCTTTC) (SEQ ID NO:3) and EBER1310 (5 
TCAGACAGGCGGACAATCTTC) (SEQ ID NO:4) (Fig. 1), which included BamHI 
restriction sites. PGR product was cloned into the BamHI site of pAcGP67B (Pharmingen). 
An expression construct (pVL-p3Gal-T5-full) designed to encode the full coding sequence 
(from first ATG, Fig. 1) was prepared by PGR with PI genomic DNA using the primer pair 
^5 EBER1309(5'-ATGGCTTGGGGAAGATGAG)(SEQIDNO:5)andEBER1310. This 
PGR product was cloned into the BamHI site of pVLl 193 (Pharmingen). Both soluble and 
full length constructs were fully sequenced to confirm fidelity. Plasmids pAcGP67- 
p3GalT5-sol and pVL-p3Gal-T5-full were co-transfected with Baculo-Gold™ DNA 
(Pharmingen) as described previously (Bennett, E.P., Hassan, H. and Clausen, H. cDNA 
2 Q cloning and expression of a novel human UDP-N-acetyl-alpha-D-galactosamine. 

Polypeptide N-acetyl-galactosaminyl-transferase, GaINAc-T3. J.Biol. Chem. 271:17006- 
17012, 1996). Recombinant Baculo-virus were obtamed after two successive amplifications 
m Sf9 cells grown in serum-containing medium, and titers of virus were estimated by 
titration in 24-well plates with monitoring of enzyme activities. Controls included 
25 pAcGP67-P3Gal-Tl (Amado, M., Almeida, R., Gameiro, F., et al. A family of human p3- 
galactosyltransferases: characterization of four members of a UDP-galactose p-N- 
acetylglucosamine/p-N-acetylgalactosamine pl,3-Galactosyltransferase family. J. Biol. 
Chem. 273:12770-12778, 1998), pAcGP67-p3Gal-T2 (Id.), pAcGP67-p4Gal-T2 (Almeida, 
R., Amado, M., David, L., et al. A Family of Human P4-Galactosyltransferases: Cloning 
30 and expression of two novel UDP-Galactose P-N-Acetylglucosamine pi,4- 

Galactosyltransferases, p4Gal-T2 and p4Gal-T3. J.Biol.Chem. 272:31979-31992, 1997), 
pAcGP67-P4Gal-T3 (Id.), and pAcGP67-GalNAc-T3-sol (Bennett, E.P., Hassan, H. and 
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Clausen, H. cDNA cloning and expression of a novel human UDP-N-acetyl-alpha-D- 
galactosamine. Polypeptide N-acetyl-galactosaminyl-transferase, GaINAc-T3. J. Biol. 
Chem, 271 : 17006-1 7012, 1996). For large scale expression amplified virus was used to 
infect High Five'^^ cells grown in serum-free media (Invitrogen) in upright roller bottles 

5 

shaking at 140 rpm and 27 °C. 

The kinetic properties were determined with partially purified, secreted 
forms of the enzymes. Semipurification of enzymes from serum-free medium of infected 
High-Five™ cells was performed by sequential Amberlite, DEAE-Sephacel and 5- 
Sepharose chromatography as described previously (Wandall, H.H., Hassan, H., 
Mirgorodskaya, E., et al. Substrate specificities of three members of the human UDP-N- 
acetyl-alpha-D-galactosamine:Polypeptide Nacetylgalactosaminyltransferase family, 
GalNAc-Tl, -T2, and -T3. J. BioL Chem, 272:23503-23514, 1997). Comparisons of 
enzymes were performed relatively to the activity obtained with pClcNAc-Bzl (Tables 11 
and ni). Full length enzymes were assayed with 1% Triton CF54 homogenates of washed 
cells. Enzyme assays were performed in 50 ^1 total reaction mixtures containing 25 mM 
Cacodylate (pH 7.5), 10 mM MnCl2, 0.25% Triton X-100, 100 ^iM UDP-[''C]-Gal (2,600 
cpm/nmol) (Amersham), and varying concentrations of acceptor substrates (Sigma) (see 
Table I for structures). Reaction products were quantified by Dowex-1 chromatography. 

2 Q Assays with glycoproteins were performed with the standard reaction mixture modified to 
contain 150 jiM UDP-Gal, 54 mM NaCl, and 0.5 mg ovalbumin, asialo-agalacto-fetuin, 
orosomucoid, or bovine submaxillary mucin acceptor substrates obtained as previously 
described (Schwientek, T., Almeida, R., Levery, S.B., Holmes, E., Bennett, E.P. and 
Clausen, H. Cloning of a novel member of the UDP-galactose: p-N-acetylglucosamine pi 

25 galactosyltransferase family, P4Gal-T4, involved in glycosphingolipid biosynthesis. 
XBioIChem, 273:29295-29305, 1998). The transfer of Gal was evaluated after acid 
precipitation by filtration through Whatman GF/C glass fibre filters. Assays to determine 
Km of acceptor substrates and donor substrates were modified to include 200 |iM UDP- 
['"^Cl-Gal (2,600 cpm/nmol) or 30mM GIcNAcp-benzyl. Assays with glycolipid acceptors 

3Q were conducted as previously described (Holmes, E.H. Characterization and membrane 
organization of beta 1 — 3- and beta 1 — 4- galactosyltransferases from human colonic 
adenocarcinoma cell lines Cob 205 and SW403: basis for preferential synthesis of type 1 
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chain lacto-series carbohydrate structures. Arch Biochem Biophys 270:630-646, 1989) in 
reaction mixtures containing 2.5 ^mol HEPES buffer, pH 7.2, 1 ^mol MnClj, 100 ng 
taurodeoxycholate or Triton CF-54, 20 \ig acceptor glycolipid, 15 nmol UDP-f'^C]- 
^ galactose (13,000 cpm/nmol) and enzyme in a total volume of 100 ^il. Conditions for 
incubation and product isolation were as previously described (Id.). 

6.2.2. STABLE EXPRESSION OF FULL CODING SEQUENCE 
OF B3Gal-TS TN CHO CELLS 
Q A cDNA sequence encoding the full coding sequence of the p3Gal-T5 gene 

was derived by RT-PCR using primers EBER 1309 and EBER 1310 with BamUl restriction 
sites introduced. The PCR product was designed to yield a p3Gal-T5 protein with a 
hydrophobic transmembrane retention signal in order to have the enzyme expressed and 
positioned in the appropriate Golgi compartment of the transfected cell. The PCR product 
g was mserted into the Banim site of a mammalian expression vector pCDNA3 (Invitrogen), 
and the construct, pCDNA3- p3Gal-T5-mem, was transfected into CHO cells and stable 
transfectants were selected. Further details are provided below. 

The fuU-length Golgi-retained form of p3Gal-T5 was stably expressed in 
Chinese Hamster Ovary cells (CHO-Kl) obtained from ATCC. The full-length coding 
J construct, designed to contain amino acids 1-310, was prepared by PCR with PI genomic 
DNA using the primer pair EBER1309 and EBER1310 (Fig. 1), which included Banim 
restriction sites. Correct insertion of the PCR product cloned into the Banim site of the 
pcDNA3 vector Gnvitrogen) was confirmed by sequencing. The predicted coding region of 
the construct is shown in Figure 1 . CHO-Kl cells were transfected using 0.2 ^g DNA and 5 
; Mg lipofectamine (Invitrogen) in subconfluent 6 well plates according to the manufacturer's 
protocol. After 48 hours, the medium was changed and 400 ^g/ml G418 was added. At 72 
hours 10-20 % of the wells were trypsinized and the percentage of cells expressing pSGal- 
T5 was evaluated by immunocytology using an anti-p3Gal-T5 monoclonal antibody, UH9. 

6.2.3. STABLE EXPRESSION OF SOLUBLE FORM OF 
B3Gal-T5 fN CHO CELLS 
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cDNA pAcGP67- P3Gal-T5-sol containing the coding sequence of a soluble, 
secreted P3Gal-T5 enzyme was cloned into the BamHl site of a modified mammalian 
expression vector, pCDNA3 (Invitrogen). pcDNA3 was modified by insertion of an 
interferon signal peptide sequence into the Kpnl/BamHl site of ensuring secretion of the 

5 

expressed product when cloned into the vector. The pcDNA3-YlNF-p3Gal-T5-sol construct 
was transfected into CHO cells and stable transfectants were selected. Further details are 
provided below. 

The secretable form of p3Gal-T5 was stably expressed in Chinese Hamster 
Ovary cells (CHO-Kl) obtained from ATCC. A truncated construct, designed to contain 
amino acids 25-310, was prepared by PGR using PI genomic DNA and the primer pair 
EBER1300 sol (SEQ ID NO:3) and EBER1310 (SEQ ID NO:4) (Fig. 1), which included 
BamBl restriction sites. The PGR product was cloned into the BamHl site of a modified 
pcDNA3 vector (Invitrogen). The pcDNA3 vector was modified to include 19 amino acids 
of the gamma-interferon signal sequence by directional insertion of a synthetic sequence of 
91 bp coding for the interferon sequence with Kpnl and BamHl flanking sites. The 
modified pcDNA3 vector was constructed as follows. Four synthetic oligonucleotides were 
synthesized: INFFOR (5*-cggggtaccggaaacgatgaaatatacaag-3*) (SEQ ID NO: 14); INFREVA 
(5'-ggcggatccaggcagatcacagccaagagaacccaaaacg-3') (SEQ ID NO: 15); INFREVB (5 - 

2 0 gcggatcccaggcagatcacagccaagagaacccaaaacg-3') (SEQ ID NO: 16); and INFREVC (5'- 

gcggatccccaggcagatcacagccaagagaacccaaaacg3') (SEQ ID NO: 1 7). Oligonucleotide primer 
pairs INFFOR/INFREVA, INFFOR/INFREVB and INFFOR/INFREVC were used to PGR 
amplify an interferon coding DNA fragment from human genomic DNA under the 
following conditions: 95 ""C for 30 seconds, 60°C for 5 seconds, 72 °C for 15 seconds, using 
25 Ampli-Taq (Perkin-Elmer Getus) and a model 480 Thermocycler (Perkin-Elmer). The use 
of three 3' primers spaced one base apart yields three vectors with a BamHl site positioned 
for any of three reading frames with respect to the signal sequence. 

GHO-Kl cells (ATGC) were transfected using 0.2 ^lg DNA and 5 ^ig 
lipofectamine (Invitrogen) in subconfluent 6 well plates according to the manufacturer's 

3 Q protocol. After 48 hours, the medium was changed and 400 ^g/ml G41 8 was added. At 72 

hours 10-20 % of the wells were trypsinized and the percentage of cells expressing p3Gal- 
T5 was evaluated by immunocytology using an anti-p3Gal-T5 monoclonal antibody, UH9. 
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6.3. CHARACTERIZATION OF THE PRODUCT FORMED 
WITH CORK:^-p-Nph BY p Sfial-TiS 

Complete glycosylation of core3-;p-Nph was performed in a reaction mixture 
g consisting of 1 mU p3Gal-T5 (specific activity determined with pGlcNAc-Umb), 2 mg 
core3-p-Nph. 50 mM Tris (pH 7.0). 1 mM MnCl^, 0.01 % Triton X-100, and 4.6 ^imol 
UDP-Gal in a final volume of 500 ^]. The glycosylation was monitored by HPTLC and 
was complete after 3 hours incubation. The reaction product was isolated as previously 
described on octadecyl-silica cartridges ("Bakerbond;" J.T. Baker, PhiUipsburg, N.J.) 
(Ahneida, R., Amado, M., David, L., et al. A Family of Human P4-Galactosy]transferases: 
Cloning and expression of two novel UDP-Galactose P-N-Acetylglucosamine pl,4- 
Galactosyltransferases, p4Gal-T2 and p4Gal-T3. J.Biol.Chem. 272:31979-31992, 1997) 
using successive stepwise elutions with MeOH. The MeOH solution was evaporated to 
dryness and subjected to 'H-NMR analysis as described below. 

15 

6.3.1. 1-D 'H-NMR SPECTROSCOPY OF REACTION PRODUCTS 
WITH CORE3-p-Np h Alvn r:h, 
The purified product fi-om reaction with core3-p-NPh was deuterium 
exchanged by repeated sonication and lyophilization fi-om DjO. A saturated solution in 
20 was used for NMR analysis. 1-D 'H-NMR, 2-D "H-'H-TOCSY (Braunschweiler, L. 
and Ernst, R.R. Coherence transfer by isotropic mixing: Application to proton correlation 
spectroscopy. J. Magn. Reson. 53:521-528. 1983; Bax, A. and Davis, D.G. MLEV-1 7- 
based two-dimensional homonuclear magnetization transfer spectroscopy. J. Magn. Reson. 
65:355-360, 1985a) and -ROESY (Bothner-By. A.A., Stephens, R.L., Lee. J.M., Warren, 
25 CD. and Jeanloz, R.W. Structure determination of a tetrasaccharide: Transient nuclear 
Overhauser effects in the rotating frame. J.Am. Chem. Soc 106:81 1-813, 1984; Bax, A. and 
Davis. D.G. Practical aspects of two-dimensional transverse NOE spectroscopy. J. Magn. 
Reson. 63:207-213, 1985b) experiments were performed at 298 °C on a Varian Unity Inova 
600 MHz spectrometer (0.5 mL in 5 mm tube) using standard acquisition software available 
3 0 in the Varian VNMR software package. A "H-detected, '^C-decoupled, phase sensitive, 
gradient (Davis. A.L., Keeler, J.. Laue. E.D. and Moskau. D. Experiments for recording 
pure-absorption heteronuclear correlation spectra using pulsed field gradients. J. Magn. 
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Reson. 98:207-216, 1992) '^C-*H-HSQC (Bodenhausen, G. and Ruben, DJ. Natural 
abundance nitrogen- 15 NMR by enhanced heteronuclear spectroscopy. Chem, Phys, Lett, 
69:185-189, 1980) experiment was performed at 298°C on a Varian Unity Inova wide bore 
500 MHz spectrometer (2 mL in 8 nun tube). A 2 mg sample of core3-pNph was prepared 
in similar fashion and analyzed under identical conditions for comparison. Chemical shifts 
are referenced to intemal acetone (2.225 and 29.92 ppm for *H and ^^C, respectively). 

The purified glycosphingolipid products from reaction with Gb4 were 
deuterium exchanged by dissolving in CDCl3-CD,OD 2:1, evaporating thoroughly under 
dry nitrogen (repeating 2x), and then dissolved in 0.5 mL DMSO-£//2% D2O (Dabrowski, 
J., Hanfland, P. and Egge, H. Structural analysis of glycosphingolipids by high resolution 
IH nuclear magnetic resonance spectroscopy. Biochemistry 19:5652-5658, 1980) for NMR 
analysis. 1-D *H-NMR spectra were acquired at 600 MHz (temperature, 308 °K); 10,000 
FIDs were accumulated, with solvent suppression by presaturation pulse during the 
relaxation delay. Spectra were interpreted by comparison to spectra of relevant 
glycosphingolipid standards acquired previously under comparable conditions (Dabrowski, 
J., Hanfland, P. and Egge, H. Structural analysis of glycosphingolipids by high resolution 
IH nuclear magnetic resonance spectroscopy. Biochemistry 19:5652-5658, 1980; Kannagi, 
R., Levery, S.B., Ishigami, F., et al. New globosides glycosphingolipids in human 
teratocarcinoma reactive with the monoclonal antibody directed to a developmentally 
regulated antigen, stage-specific embryonic antigen 3. J. Biol Chem. 258:8934-8942, 
1983). 

6.4. RESTRICTED ORGAN EXPRESSION PATTERN OF p3Gal-T5 

Total RNA was isolated fi-om human adenocarcinoma cell lines AsPC-1, 
BxPC-3, Capan-1, Capan-2, Colo357, HPAF, HT-29, PANC-1, Suit2, and S2-013 as 
described previously (Sutherlin, M.E., Nishimori, I., Caffi-ey, T., et al. Expression of three 
UDP-N-acetyl-alpha-D galactosamine:polypeptide GalNAc N-acetylgalactosaminyl- 
transferases in adenocarcinoma cell lines. Cancer Res 57:4744-4748, 1997). Twenty five 
ptg of total RNA was subjected to electrophoresis on a 1 % denaturing agarose gel and 
transferred to nitrocellulose as described previously (Sutherlin, M.E., Nishimori, I., Caffrey, 
T., et al. Expression of three UDP-N-acetyl-alpha-D galactosamine:polypeptide GalNAc N- 
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acetylgalactosaminyltransferases in adenocarcinoma cell lines. Cancer Res 57:4744-4748, 
1997). Human Multiple Tissue northern blots, MTNI and MTNH, were obtained from 
Clontech. The soluble expression construct was used as probe. The probe was labeled by 

^ random priming using aP^2jCTP(Amersham) and an oligo labeling kit (Pharmacia). The 
blots were probed overnight at 42 °C as previously described (Bennett. E.P., Hassan, H. and 
Clausen, H. cDNA cloning and expression of a novel human UDP-N-acetyl-alpha-D- 
galactosamine. Polypeptide N-acetyl-galactosaminyl-transferase, GaINAc-T3. J. Biol. 
Chem. 271:17006-17012, 1996), washed 2x10 min at RT with 2 x SSC, 1% Na,PA, 2 x 
20 min at 65°C with 0.2 x SSC, 1 % SDS, \% Na^P^O, and once 10 min with 0.2 x SSC at 
RT ("preferred hybridization conditions"). 

6.5. ANALYSIS OF DNA POT YMORPHTSM OF THF p ^ Gal-T5 OFNIT 

Primer pairs EBER 1320 (5'-CAGCGAGGTTCTAGAGTTTCC-3') (SEQ 
^5 ID NO:6) and EBER 1321 (5'-GAAATCCACGCCAGAATGTCG-3') (SEQ ID NO:7) for 
amplification of the entire coding sequence have been used for PCR amplification of exon 
1. The PCR product was subcloned and the sequence of 10 clones containing the 
appropriate insert was deteimined assuring that both alleles of each individual are 
characterized. 

20 

6.6. ANTIBODTES TO p.^Cal-TS 
An anti-p3GaI-T5 mononclonal antibody, UH9, was prepared by 
immunizing mice with a purified P3Gal-T5 preparation that gave a single band of 
approximately 35,000 on SDS-PAGE Coomassie stained gel. Balb/c mice were immunized 

25 with one subcutaneous or intraperitoneal injection of 10 ^1 undematured protein in Freunds 
complete adjuvant, followed by two injections with Freunds incomplete adjuvant, and 
finally an intravenous booster without adjuvant. Eyebleeds were taken 7 days after third 
inmiunization, and the titer and specificity of anti-p3Gal-T5 antibodies was evaluated. 
Fusion to NS-1 and the cloning procedure was as described in White et al.. Biochemistry 

3 0 29:2740 (1990). The mononclonal antibody UH9 was selected for reactivity with unfixed 
cells and/or tissues, as well as ability to immunoprecipitate P3Gal-T5 activity. Hybridomas 
were selected by three criteria: (i) differential reactivity in ELISA assays with purified 
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recombinant enzymes; (ii) immunocytology on Sf9 cells two days after infection with 
Baculovirus containing pSGal-transferases, p3Gal-Tl, -T2, -T3, -T4, and -T5; and (iii) 
differential immunoprecipitation of active recombinant enzymes. 

ELISA analysis was performed as described by White et al. (Id.), using 

5 

purified recombinant p3Gal-Tl, -T2, and -T5, using an initial antigen concentration of 10 
tig/ml. 

The immunocytology assay was performed by washing trypsinized cells 
twice in PBS and air drying the washed cells onto coverslides. Dried slides were fixed in 
100% ice cold acetone for 10 min, dried, and incubated with monoclonal anti-P3Gal-T5 
antibody for 1 hour. After washing with PBS, slides were incubated with FITC-conjugated 
rabbit anti-mouse IG for 30 minutes, washed with PBS and mounted in glycerol and 
analyzed by microscopy. 

Immunoprecipitation of recombinant human p3Gal-transferases was 
_ - performed as follows. Secreted forms of human p3Gal-transferases were expressed in Sf9 
cells and media were harvested three days post-infection and used as enzyme source. 
Protein G Sepharose was saturated sequentially with rabbit anti-mouse IgG and monoclonal 
antibodies as culture supematants. A 5% suspension of Protein G beads was added to Sf9 
medium containing either GalNAc-Tl, -T2, -T3 or -T4. After incubation for 1 hour at 4 
2 Q degrees C, beads were washed in PBS, and resuspended in 25 mM Tris (pH 7.4), 0.25% 
Triton X-100. pSGal-transferase activities were measured in the supematants and the 
washed pellets. UH9 selectively immunoprecipitated p3Gal-T5 activity but not p3Gal-Tl or 
-T2 activity. 

Western blot analysis with purified recombinant enzymes was also 

2 5 performed. It proved difficult to select antibodies reactive with both the native and the 

denatured P3Gal-T5 enzyme. The antibody UH9 is therefore likely to be directed to a 
conformational epitope, and to detect the native conformation of p3Gal-T5. Another 
antibody, designated UHIO, only reacted with denatured P3Gal-T5 as evidenced by ability 
to westem blot This antibody did stain insect cells infected with pVL-P3Gal-T5-fiill and 

3 Q pAcGP67-P3Gal-T5-sol, but it did not stain CHO cells stably transfected with the p3Gal-T5 

expression constructs or various epithelial cell lines and tissues. Furthermore, UHIO did 
not immunoprecipitate P3Gal-T5 enzyme activity. 
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To correlate immunoreactivity with enzyme activity, transfected cells 
expressing soluble P3Gal-T5 were trypsinized and plated in 96 well plates. Two rounds of 
screening and cloning by limiting dilution using immunoreactivity with UH9 were 
g performed and clones achieving over 50 % positive cells were selected and tested for level 
of secreted enzyme in supernatant of confluent cultures. The intensity of immunoreactivity 
by the cytology assay correlated in all cases with level of p3Gal-T5 enzyme activity found 
in spent media from clones. 

10 ""^^ invention described and claimed herein is not to be limited in scope by 

the specific embodiments herein disclosed since these embodiments are intended as 
illustration of several aspects of the invention. Any equivalent embodiments are intended to 
be within the scope of this invention. Indeed, various modifications of the invention in 
addition to those shown and described herein will become apparent to those skilled in the 
art from the foregoing description. Such modifications are also intended to fall within the 
scope of the appended claims. Throughout this appHcation various references are cited, the 
contents of each of which is hereby incorporated by reference into the present apphcation in 
its entirety. 



20 



25 



30 
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WHAT IS CLAIMED IS: 

1 . An isolated nucleic acid encoding UDP- galactose: p- A^-acetylglucosamine 
pi, 3-galactosykransferase (P3Gal-T5). 

5 

2. The isolated nucleic acid as defined in claim 1 , wherein said nucleic acid is 

DNA. 

3. The isolated nucleic acid as defined in claim 2, wherein said DNA is cDNA. 

10 

4. An isolated nucleic acid as defined in claim 2, wherein said DNA is genomic 

DNA. 

5. An isolated nucleic acid as defined in claim 1, wherein said nucleic acid 
comprises the nucleotide sequence of nucleotides 1-930 as set forth in Figure 1 or sequence- 
conservative or function-conservative variants thereof. 

6. An isolated nucleotide sequence comprising nucleotides 73-930 as set forth 
in Figure 1, 

20 

7. An isolated nucleotide sequence comprising nucleotides selected fi-om the 
group consisting of nucleotides 1-200; nucleotides 201-400; nucleotides 401-600; 
nucleotides 601-800; and nucleotides 801-930 of claim 1 that hybridizes to a nucleic acid 
vmder stringent conditions. 

25 

8. An isolated nucleic acid which hybridize under conditions of high stringency 
with the nucleic acid having the sequence of nucleotides 1-930 as set fourth in Figure 1 . 

9. A nucleic acid vector comprising a nucleic acid sequence encoding p3Gal- 

30 

T5 or fragments thereof 
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10. A vector as defined in claim 9, wherein said sequence comprises the 
nucleotide sequence of nucleotides 1-930 as set forth in Figure 1 or sequence-conservative 
or function-conservative variants thereof 

S 11. The vector as defined in claim 1 0, wherein said sequence encoding p3Gal-T5 

is operably linked to a transcriptional regulatory element. 

12. A nucleic acid vector comprosing the nucleotide sequence of claim 5. 

^° 13. A nucleic acid vector comprosing the nucleotide sequence of claim 6. 

14. A host cell comprising a vector as defined in claim 9. 

15. A host comprising a vector as defined in claim 11, wherein said cell is stably 
transfected with said vector. 



16. The host cell as defined in claim 14, wherein said cell is stably transfected 
with said vector. 

20 

17. The host cell as defined in claim 13, wherein said cell produces 
enzymatically active p3Gal-T5. 

18. The host cell as defined in claim 13, wherein said cell is selected from the 
group consisting of bacterial, yeast, insect, avian, and mammalian cells. 

2 5 

19. The host cell as defmed in claim 17, wherein said cell is selected from the 
group consisting of bacterial, yeast, insect, avian, and mammalian cells. 

2 Q 20. A cell as defined in claim 1 9, wherein said cell is Sf9. 

21 . A cell as defmed in claim 19, wherein said cell is CHO. 



46 - 



BNSDOCID: <WO__00a9SSaA1_l > 



wo 00/29558 



PCTAJS99/26807 



22. A host cell comprising the nucleic acid vector of claim 12, 

23. A host cell comprising the nucleic acid vector of claim 13. 

5 24. A method for producing p3Gal-T5 polypeptides, which comprises: 

(i) introducing into a host cell an isolated DNA molecule encoding a 
human P3Gal-T5, or a DNA construct comprising a DNA sequence 
encoding p3Gal-T5; 

(ii) growing the host cell under conditions suitable for human P3Gal-T5 
■^^ expression; and 

(iii) isolating p3Gal-T5 produced by the host cell. 

25. An isolated UDP- galactose: P- A^-acetylglucosamine pi , 3- 
galactosyltransferase (p3Gal-T5) polypeptide. 

15 



20 



25 



30 
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[received by the International Bureau on 03 April 2000 (03 04 00)- 
original claims 1-21, 24 and 25 amended; new claims 26-37 added- 
rernaimng claims unchanged (5 pages) 

1 . An isolated nucleic acid of less than 1 0,000 nucleotides or a 

g complement thereof comprising a nucleotide sequence encoding UDP-galactose: ^-N- 
acetylglucosamine pl,3galactosyltransferase (p3Gal-T5) (SEQ ID NO:9). 

2. The isolated nucleic acid of claim 1 which is DNA. 

10 3. The isolated DNA of claim 2 which is cDNA or genomic DNA. 

4. The isolated nucleic acid of claim 1 which is RNA. 

5. An isolated nucleic acid of less than 10,000 nucleotides or a 
complement thereof comprising a nucleotide sequence encoding P3Gal-T5 (SEQ ID N0:9), 
or a function-conservative variant thereof. 

6. An isolated nucleic acid of less than 1 0,000 nucleotides or a 
complement thereof comprising nucleotides 73-930 as set forth in Figure 1, or a sequence- 

2 Q conservative or a function-conservative variant thereof 

7. An isolated nucleic acid of less than 1 0,000 nucleotides or a 
complement thereof consisting of at least 20 contiguous nucleotides selected from 
nucleotides 1-115 and 428-101 1 of SEQ ID NO:8. 

25 

8. An isolated nucleic acid of less than 1 0,000 nucleotides or a 
complement thereof which hybridizes under preferred hybridization conditions with a 
nucleic acid consisting of at least 20 contiguous nucleotides selected from nucleotides 1-115 
and 428-101 1 of SEQ ID NO:8. 

30 

9. A nucleic acid vector of less than 50,000 nucleotides comprising a 
first nucleotide sequence identical to or derived from a second nucleotide sequence 
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encoding P3Gal-T5 (SEQ ID NO:9) or a fragment thereof having at least 15 amino acid 
residues. 

10. A nucleic acid vector of less than 10,000 nucleotides comprising a 

5 

first nucleotide sequence identical to or derived from a second nucleotide sequence 
encoding P3Gal-T5 (SEQ ID NO:9) or a fragment thereof having at least 120 amino acid 
residues. 

^ ^ 11. The vector of Claim 1 0, wherein the first nucleotide sequence is 

operably linked to a transcriptional regulatory element. 

12. A nucleic acid vector of less than 50,000 nucleotides comprising the 
nucleotide sequence of Claim 5. 

15 

13. A nucleic acid vector of less than 50,000 nucleotides comprising the 
nucleotide sequence of Claim 6. 

14. A host cell comprising the nucleic acid vector of Claim 10. 

20 

15. A host cell stably transfected with the nucleic acid vector of Claim 

11. 



16. The host cell of Claim 14, wherein the nucleic acid vector is stably 

25 transfected. 

17. The host cell of Claim 14 which produces P3Gal-T5 enzymatic 

activity. 

3 0 18. The host cell of Claim 1 4 which is selected from the group consisting 

of a bacterial cell, a yeast cell, an insect cell, an avian cell and a mammalian cell. 
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1 9. The host cell of Claim 1 7 which is selected from the group consisting 
of a bacterial cell, a yeast cell, an insect cell, an avian cell and a mammalian cell. 

5 20. The insect cell of Claim 1 9 which is a Spodoptera frugiperda cell. 

21. The mammalian Cell of claim 19 which is a Chinese hamster ovary 

cell. 

10 22. A host cell comprising the nucleic acid vector of Claim 12. 

23. A host cell comprising the nucleic acid vector of Claim 13. 

24. A method for producing a p3Gal-T5 polypeptide (SEQ ID NO:9), a 
^3 fragment thereof having at least 30 amino acid residues, or a function-conservative variant 

of the polypeptide or fragment, which comprises: 

(i) introducing into a host cell a nucleic acid encoding the p3Gal-T5 
polypeptide, fragment or variant; 

(ii) growing the host cell under conditions suitable for expression of the 
2 Q nucleic acid introduced; and 

(iii) isolating the p3Gal-T5 polypeptide, fragment or variant expressed by 

the host cell. 



25 



25. An isolated p3Gal-T5 polypeptide (SEQ ID NO:9). 

26. An isolated p3Gal-T5 polypeptide (SEQ ID NO:9) or a fragment 
thereof having at least 5 amino acid residues. 



27. An isolated P3Gal-T5 polypeptide (SEQ ID NO:9), a fragment 
3 0 thereof having at least 50 amino acid residues, or a function-conservative variant of the 
polypeptide or fragment. 
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28. An isolated P3Gal-T5 polypeptide comprising or consisting of the 
amino acid sequence as set forth in SEQ ID NO:9. 

^ 29. An isolated derivative or fragment of the polypeptide of Claim 28 

consisting of at least 10 contiguous amino acids of the p3Gal-T5 polypeptide. 

30. The derivative or fragment of Claim 29 which displays one or more 
functional activities of the p3Gal-T5 polypeptide. 

10 

3 1 . The derivative or fragment of Claim 29 which is capable of 
immunospecific binding to an antibody raised against a p3Gal-T5 polypeptide. 

32. A fusion protein comprising a fragment of the p3Gal-T5 polypeptide 
of Claim 28 consisting of at least 10 contiguous amino acids of the p3Gal-T5 polypeptide 
fused by a covalent bond to an amino acid sequence of a second protein, which second 
protein is not a P3Gal-T5 polypeptide. 

33. An isolated galactosyltransferase capable of transferring galactose 
2 Q p 1-3 to an acceptor substrate selected from the group consisting of GalNAcp 1-3 Gala 1 - 

4Gaipi-4Glcpl-Cer, Glcpl-Cer, GalNAcp l-4Gaipi.4Glcpi-Cer, GlcNAcpl-3Galpl- 
4Glcpl-Cer, and GlcNAcpi-3Gaipi-4GlcNAcpi-3Galpl-4Glcpl-Cer. 

34. An isolated p3Gal-T5 polypeptide (SEQ ID NO:9), a fragment 

2 5 thereof or a function-conservative variant of the polypeptide or fragment which is capable 

of transferring galactose pl-3 to GalNAcp 1-3 Gala l-4Galpl-4Glcpi-Cer. 

35. A purified antibody or an antigen-binding fragment or derivative 
thereof capable of immunospecific binding to the polypeptide of any one of Claims 25-28 

3 Q and not to another galactosyltransferase. 
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36. The antibody of Claim 35 which is selected from the group consisting 
of a polyclonal antibody, a monoclonal antibody, a chimeric antibody and a humanized 
antibody. 

37. The monoclonal antibody of Claim 36 which is UH9. 
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pSGal-TS 



'IB CCACCTCAGCCTCCTAGCATAAAP.CTAGACACATCCTCATGCTTTTGAGGTCTP-J^TCATTGGATTTTGTTCCTTTCAG -1 

I MAFPKMRLMYIC LLVLGALCLYYSMb? 26 

1 ATGGCTTTCCCGAAGATGAGAT7GATGTATATCTGCCTTCTGGTTCTGGGGGCTCTTTGTTTGTATTATAGCATGAA.C IS 



27 SLNPFKEQSFVYKKDGKFLKLPDTDC 52 

79 AGTCTAAATCCTTTC AAAGAACAGTCCTTTGTTTACAAGAAAGACGGGAACTTCCTTAAGCTCCCAGATACAGACTGC 156 

53 RQTPPFLVLLVTSSHKQLAERMAIRQ 78 

157 AGGCAGACACCTCCCTTCCTCGTCCTGCTGGTGRCCTCATCCCACAAACAGTTGGCTGAGCGCATGGCCATCCGGCAG 234 

79 TWGKERTVKGKQLKTFFLLGTTSSAA 104 

23 5 ACGTGGGGGAAAGAGAGGACGGTGAAGGGAAAGCAGCTGAAGACATTCTTCCTCCTGGGGACCACCAGCAGTGCAGCG 312 

105 ETKEVDQESQRHGDI IQKDFLDVYY N* 130 

313 GAAAC AAAAGAGGTGG ACCAGG AGAGCCAGCGACACGGGGACATTATCC AG AAGG ATTTCCTAG ACGTCTATTACAAT 390 

131 LTLKTMMGIEWVHRFCP QAAF VMKTD 156 

391 CTGACCCTGAAGACCATGATGGGCATAGA.ATGGGTCCATCGCTTTTGTCCTCAGGCGGCGTTTGTGATGAAAACAGAC 468 

157 SDMFIMVDYLTELLLKKM*?. TTRFFTG . 182 

4 69 TCAGACATGTTCATCAATGTTGACTATCTGACTGA.^CrGCTTCTGAAGAAAAACAGAACAACCAGGTTTTTCACTGGC 54 6 

183 FLKLNEFPIRQPFSKWFVSKSEYPWD 208 

547 TTCTTGAAACTCAATGAGTTTCCCATCAGGCAGCCATTCAGCAAGTGGTTTGTCAGTAAATCTGAATATCCGTGGGAC 624 

209 RYPPFCSGTGYVFSG DVASQVYN*VSK 234 

625 AGGTACCCACCATTCTGCTCCGGCACCGGCTACGTGTTTTCTGGCGACGTGGCGAGTCAGGTGTACAATGTC7CCAAG 702 

235 SVPY IKLEDVFVGLCLERLNIRLEEL 260 

703 AGCGTCCCATACATTAAACTGGAAGACGTGTTTGTGGGGCTCTGCCTCGAAAGGCTGAACATCAGATTGGAGGAGCTC 780 

261 HSQPTFFPGGLRFSVCLFRRIVACHF 286 

781 CACTCCCAGCCGACCTTTTTTCCAGGGGGCTTACGCTTCTCCGTATGCCTCTTCAGGAGGATCGTGGCCTGCCACTTC 858 

287 IKPRTLLDYWQALENSRGEDCPPV* 310 

859 ATCAAGCCTCGGACTCTCTTGGACTACTGGCAGGCTCTAGAGAATTCCCGGGG QGXAG ATTGTCCGCCTGTCTGA , . . 933 



FIGURE 1 
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SEQUENCE LISTING 



<110> Clausen, Henrik 
Amado, Margarita 

<120> UDP -GALACTOSE: BETA-N- ACETYL -GLUCOSAMINE BETA 1 3 
GALACTOSYLTRANSFERASES, BETAGAL-T5 

<130> 7188-157 



<140> 
<141> 



<160> 17 



<170> Patentin Ver . 2.0 

<210> 1 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PRIMER 
<400> 1 

cttccttaag ctcccagata c 



<210> 2 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PRIMER 
c400> 2 

gtttccgctg cactgcactg ctggtg 



<210> 3 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PRIMER 
<400> 3 

atgtacagtc taaatccttt c 



<210> 4 
<211> 21 
<212> DNA 

<213> Artificial Sequence 



1 
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<220> 

<223> Description of Artificial Sequence: PRIMER 
<400> 4 

tcagacaggc ggacaatctt c 21 



<210> 5 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PRIMER 
<400> 5 

atggctcccg aagatgag 18 



<210> 6 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PRIMER 
<400> 6 

cagcgaggtt ctagagtttc c 21 



<210> 7 
<211> 21 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: PRIMER 
<400> 7 

gaaatccacg ccagaatgtc g 21 



<210> 8 
<211> 1011 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (79) . . (1008) 

<400> 8 

ccacctcagc ctcctagcat aaaactagac acatcctcat gcttttgagg tctaatcatt 60 

ggattttgtt cctttcag atg get ttc ccg aag atg aga ttg atg tat ate 111 

Met Ala Phe Pro Lys Met Arg Leu Met Tyr lie 
15 10 
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tgc ctt ctg gtt ctg ggg get ctt tgt ttg tat ttt age atg tac agt 
Cys Leu Leu Val Leu Gly Ala Leu Cys Leu Tyr Phe Ser Met Tyr Ser 
15 20 



25 



40 



lil T f^^ f ^""^ ^^'^ =^9 =ct ccc ttc etc 

Phe Leu Lys Leu Pro Asp Thr Asp Cys Arg Gin Thr Pro Pro Phe Leu 
45 50 55 

gtc ctg ctg gtg acc tea tee cac aaa eag ttg get gag cgc ata ccc 
val Leu Leu Val Thr Ser Ser His Lys Gin Leu Ila Slu A^g Met I^a 

65 70 75 

ri^ t^^ ^''^ 9^9 ^99 acg gtg aag gga aag eag ctg 

He Arg Gin Thr Trp Gly Lys Glu Arg Thr Val Lys ciy LyI Gin LeS 
80 85 90 

Lvf tZ 111 ll"" f^^ ^^'^ ^9C agt gea geg gaa aea aaa 

Lys Thr Phe Phe Leu Leu Gly Thr Thr Ser Ser Ala Ala Glu Thr Lys 
95 100 105 

Ilu v=? Hf^ "^^3 '"^'^ 9gg gac att ate eag aag gat 

Glu val Asp Gin Glu Ser Gin Arg His Gly Asp He He Gin Lys Asp 

115 120 

ttc eta gac gte tat tac aat ctg ace ctg aag acc atg atg ggc ata 
Phe Leu Asp val Tyr Tyr Asn Leu Thr Leu Lys Thr Me? Me? lly xH 
125 130 

atu S vli Hf ^^"^ 5^=9 9tg atg aaa aca 

Glu Trp Val His Arg Phe Cys Pro Gin Ala Ala Phe Val Met Lys Thr 

■^^^ 145 



150 



155 



gac tea gac atg ttc ate aat gtt gac tat ctg act gaa ctg ctt ctq 
ASP ser Asp Met Phe He Asn Val Asp Tyr Leu Thr llu Leu ^eu Leu 



170 



aag aaa aac aga aca acc agg ttt ttc act ggc ttc ttg aaa etc aat 
Lys Lys Asn Arg Thr Thr Arg Phe Phe Thr oiy Phe Leu Lys llu lln 



159 



eta aat cct ttc aaa gaa eag tec ttt gtt tac aag aaa gac ggg aac 207 
Leu Asn Pro Phe Lys Glu Gin Ser Phe Val Tyr Lys Lys Isp g!? Isn 
30 35 



180 185 



llu Phi Pr^ T?"" n^^ """"^ tct 

Glu Phe Pro He Arg Gin Pro Phe Ser Lys Trp Phe Val Ser Lys Ser 
190 195 



200 



gaa tat ecg tgg gac agg tac cea cea ttc tgc tec ggc acc ggc tac 
Glu Tyr Pro Trp Asp Arg Tyr Pro Pro Phe Cys Ser Gly Thr Gly Tyr 



210 



215 



?a? m 1 ^^"^ ^""^ ""^^ tac aat gtc tec aag age 

val Phe Ser Gly Asp Val Ala Ser Gin Val Tyr Asn Val Ser Lys Ser 

225 230 235 



255 



303 



351 



399 



447 



495 



543 



591 



639 



687 



735 



783 
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gtc cca tac att 
Val Pro Tyr lie 



agg ctg aac ate 
Arg Leu Asn lie 
255 

cca ggg ggc tta 
Pro Gly Gly Leu 
270 

tgc cac ttc ate 
Cys His Phe lie 
285 

gag aat tec egg 
Glu Asn Ser Arg 
300 



aaa ctg gaa 
Lys Leu Glu 
240 

aga ttg gag 
Arg Leu Glu 



cgc ttc tec 
Arg Phe Ser 



aag cct egg 
Lys Pro Arg 
290 

ggg gaa gat 
Gly Glu Asp 
305 



gac gtg ttt gtg 
Asp Val Phe Val 
245 

gag etc eac tec 
Glu Leu His Ser 
260 

gta tgc etc ttc 
Val Cys Leu Phe 
275 

act etc ttg gac 
Thr Leu Leu Asp 



tgt ceg ect gtc 
Cys Pro Pro Val 
310 



ggg etc tgc 
Gly Leu Cys 



cag ceg ace 
Gin Pro Thr 
265 

agg agg ate 
Arg Arg lie 
280 

tac tgg cag 
Tyr Trp Gin 
295 

tga 



etc gaa 831 

Leu Glu 

250 

ttt ttt 879 
Phe Phe 



gtg gcc 927 
Val Ala 



get eta 975 
Ala Leu 



1011 



<210> 9 
<211> 310 
<212> PRT 

<213> Homo sapiens 
<400> 9 

Met Ala Phe Pro Lys Met Arg Leu Met Tyr lie Cys Leu Leu Val Leu 
15 10 15 

Gly Ala Leu Cys Leu Tyr Phe Ser Met Tyr Ser Leu Asn Pro Phe Lys 
20 25 30 

Glu Gin Ser Phe Val Tyr Lys Lys Asp Gly Asn Phe Leu Lys Leu Pro 
35 40 45 

Asp Thr Asp Cys Arg Gin Thr Pro Pro Phe Leu Val Leu Leu Val Thr 
50 55 60 

Ser Ser His Lys Gin Leu Ala Glu Arg Met Ala lie Arg Gin Thr Trp 
65 70 75 80 

Gly Lys Glu Arg Thr Val Lys Gly Lys Gin Leu Lys Thr Phe Phe Leu 
85 90 95 

Leu Gly Thr Thr Ser Ser Ala Ala Glu Thr Lys Glu Val Asp Gin Glu 
100 105 110 

Ser Gin Arg His Gly Asp lie lie Gin Lys Asp Phe Leu Asp Val Tyr 
115 120 125 

Tyr Asn Leu Thr Leu Lys Thr Met Met Gly lie Glu Trp Val His Arg 
130 135 140 

Phe Cys Pro Gin Ala Ala Phe Val Met Lys Thr Asp Ser Asp Met Phe 
145 150 155 160 
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He Asn Val Asp Tyr Leu Thr Glu Leu Leu Leu Lys Lys Asn Arg Thr 
165 

Thr Arg Phe Phe Thr Gly Phe Leu Lys Leu Asn Glu Phe Pro lie Arg 
180 185 190 



Gin Pro Phe Ser Lys Trp Phe Val Ser Lys Ser Glu Tyr Pro Trp Asp 
195 200 205 

Arg Tyr Pro Pro Phe Cys Ser Gly Thr Gly Tyr Val Phe Ser Gly Asp 
210 215 220 

Val Ala Ser Gin Val Tyr Asn Val Ser Lys Ser Val Pro Tyr He Lys 

230 235 240 

Leu Glu Asp val Phe Val Gly Leu Cys Leu Glu Arg Leu Asn He Arg 
245 250 255 

Leu Glu Glu Leu His Ser Gin Pro Thr Phe Phe Pro Gly Gly Leu Arg 

255 270 

Phe Ser Val Cys Leu Phe Arg Arg He Val Ala Cys His Phe He Lys 
275 280 285 

Pro Arg Thr Leu Leu Asp Tyr Trp Gin Ala Leu Glu Asn Ser Arg Gly 

295 

Glu Asp Cys Pro Pro Val 
305 



<210> 10 

<211> 422 

<212> PRT 

<213> Homo sapiens 

<400> 10 

Met Leu Gin Trp Arg Arg Arg His Cys Cys Phe Ala Lys Met Thr Trp 

5 10 15 

Asn Ala Lys Arg Ser Leu Phe Arg Thr His Leu He Gly Val Leu Ser 
20 25 30 

Leu Val Phe Leu Phe Ala Met Phe Leu Phe Phe Asn His His Asp Trp 
35 40 45 

Leu Pro Gly Arg Ala Gly Phe Lys Glu Asn Pro Val Thr Tyr Thr Phe 
, ^° 55 60 



Arg Gly Phe Arg Ser Thr Lys Ser Glu Thr Asn His Ser Ser Leu Arg 
" 75 80 

Asn He Trp Lys Glu Thr Val Pro Gin 



85 90 



Thr Leu Arg Pro Gin Thr Ala 

95 



Thr Asn Ser Asn Asn Thr Asp Leu Ser Pro Gin Gly Val Thr Gly Leu 
100 105 110 



5 



BNSDOCIO: <WO 0Q29558A l_l_> 



wo 00/29558 



Glu Asn Thr Leu Ser 
115 

Gly His Pro Asn Ser 
130 

Lys Cys Gin Glu Lys 
145 

Pro Gly Gin lie Glu 
165 

Glu Ser Leu Ala Pro 
180 

Leu Ser lie Lys Leu 
195 

Ser Arg Gin Tyr His 
210 

Tyr Asn Leu Thr lie 
225 

Tyr Cys Pro His lie 
245 

Val Asn Thr Glu Tyr 
260 

Pro Arg His Asn Tyr 
275 

Asn Arg Asn Lys Asp 
290 

Ser Glu Arg Tyr Pro 
305 

Gly Asp Leu Ala Glu 
325 

Leu His Leu Glu Asp 
340 

lie Asp Pro Val Pro 
355 

Val Ser Tyr Ser Ser 
370 

Phe Gin Pro Ser Glu 
385 

Lys His Asn Ala Cys 
405 



Ala Asn Gly Ser lie 
120 

Tyr His Phe Lys Tyr 
135 

Ser Pro Phe Leu lie 
150 

Ala Arg Arg Ala lie 
170 

Gly lie Gin lie Thr 
185 

Asn Gly Tyr Leu Gin 
200 

Asp lie lie Gin Gin 
215 

Lys Thr Leu Met Gly 
230 

Pro Tyr Val Met Lys 
250 

Leu lie Asn Lys Leu 
265 

Phe Thr Gly Tyr Leu 
280 

Ser Lys Trp Tyr Met 
295 

Val Phe Cys Ser Gly 
310 

Lys lie Phe Lys Val 
330 

Val Tyr Val Gly lie 
345 

Pro Pro Asn Glu Phe 
360 

Cys Lys Tyr Ser His 
375 

Leu lie Lys Tyr Trp 
390 

Ala Asn Ala Ala Lys 
410 



Tyr Asn Glu Lys Gly 
125 

He He Asn Glu Pro 
140 

Leu Leu He Ala Ala 
155 

Arg Gin Thr Trp Gly 
175 

Arg lie Phe Leu Leu 
190 

Arg Ala He Leu Glu 
205 

Glu Tyr Leu Asp Thr 
220 

Met Asn Trp Val Ala 
235 

Thr Asp Ser Asp Met 
255 

Leu Lys Pro Asp Leu 
270 

Met Arg Gly Tyr Ala 
285 

Pro Pro Asp Leu Tyr 
300 

Thr Gly Tyr Val Phe 
315 

Ser Leu Gly He Arg 
335 

Cys Leu Ala Lys Leu 
350 

Val Phe Asn His Trp 
365 

Leu He Thr Ser His 
380 

Asn His Leu Gin Gin 
395 

Glu Lys Ala Gly Arg 
415 
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Thr 

Glu 

Glu 
160 

Asn 

Gly 

Glu 

Tyr 

Thr 
240 

Phe 

Pro 

Pro 

Pro 

Ser 
320 

Arg 

Arg 

Arg 

Gin 

Asn 
400 

Tyr 
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Arg His Arg Lys Leu His 
420 

<210> 11 
<211> 326 
<212> PRT 

<213> Homo sapiens 
<400> 11 

Met Ala Ser Lys Val Ser Cys Leu Tyr Val Leu Thr Val Val Cys Trp 

^ 10 15 



Ala Ser Ala Leu Trp lyr Leu Ser lie Thr Arg Pro Thr Ser Ser Tyr 
2° 25 30 

Thr Gly Ser Lys Pro Phe Ser His Leu Thr Val Ala Arg Lys Asn Phe 
35 40 



45 



Thr Phe Gly Asn He Arg Thr Arg Pro He Asn Pro His Ser Phe Glu 

Phe Leu He Asn Glu Pro Asn Lys Cys Glu Lys Asn He Pro Phe Leu 

75 80 

Val He Leu He Ser Thr Thr His Lys Glu Phe Asp Ala Arg Gin Ala 
85 90 95 

He Arg Glu Thr Trp Gly Asp Glu Asn Asn Phe Lys Gly He Lys He 
100 105 

Ala Thr Leu Phe Leu Leu Gly Lys Asn Ala Asp Pro Val Leu Asn Gin 

120 125 

Met val Glu Gin Glu Ser Gin He Phe His Asp He He Val Glu Asp 

Phe He Asp Ser Tyr His Asn Leu Thr Leu Lys Thr Leu Met Gly Met 



160 



Arg Trp Val Ala Thr Phe Cys Ser Lys Ala Lys Tyr Val Met Lys Thr 
165 170 

Asp ser Asp He Phe Val Asn Met Asp Asn Leu He Tyr Lys Leu Leu 
180 185 190 

Lys Pro Ser Thr Lys Pro Arg Arg Arg Tyr Phe Thr Gly Tyr Val He 
195 200 205 

Asn Gly Gly Pro He Arg Asp Val Arg Ser Lys Trp Tyr Met Pro Arg 

215 220 

Asp Leu Tyr Pro Asp Ser Asn Tyr Pro Pro Phe Cys Ser Gly Thr Gly 



235 240 



Tyr He Phe Ser Ala Asp Val Ala Glu Leu He Tyr Lys Thr Ser Leu 



250 255 
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His Thr Arg Leu 
260 

Arg Lys Leu Gly 
275 

Lys Met Ala Tyr 
290 

Gin lie Ser Pro 
305 

Lys Lys His Leu 



Leu His Leu Glu 



lie His Pro Phe 
280 

Ser Leu Cys Arg 
295 

Glu Glu Met His 
310 

Arg Cys 
325 



Asp Val Tyr Val 
265 

Gin Asn Ser Gly 



Tyr Arg Arg Val 
300 

Arg lie Trp Asn 
315 



Gly Leu Cys Leu 
270 

Phe Asn His Trp 
285 

lie Thr Val His 



Asp Met Ser Ser 
320 



<210> 12 
<211> 331 
<212> PRT 

<213> Homo sapiens 
<400> 12 

Met Ala Ser Ala Leu Trp Thr Val Leu Pro Ser Arg Met Ser Leu Arg 
15 10 15 

Ser Leu Lys Trp Ser Leu Leu Leu Leu Ser Leu Leu Ser Phe Phe Val 
20 25 30 

Met Trp Tyr Leu Ser Leu Pro His Tyr Asn Val lie Glu Arg Val Asn 
35 40 45 

Trp Met Tyr Phe Tyr Glu Tyr Glu Pro lie Tyr Arg Gin Asp Phe His 
50 55 60 

Phe Thr Leu Arg Glu His Ser Asn Cys Ser His Gin Asn Pro Phe Leu 
65 70 75 80 

Val lie Leu Val Thr Ser His Pro Ser Asp Val Lys Ala Arg Gin Ala 
85 90 95 

lie Arg Val Thr Trp Gly Glu Lys Lys Ser Trp Trp Gly Tyr Glu Val 
100 105 110 

Leu Thr Phe Phe Leu Leu Gly Gin Glu Ala Glu Lys Glu Asp Lys Met 
115 120 125 

Leu Ala Leu Ser Leu Glu Asp Glu His Leu Leu Tyr Gly Asp lie lie 
130 135 140 

Arg Gin Asp Phe Leu Asp Thr Tyr Asn Asn Leu Thr Leu Lys Thr lie 
145 150 155 160 

Met Ala Phe Arg Trp Val Thr Glu Phe Cys Pro Asn Ala Lys Tyr Val 
165 170 175 

Met Lys Thr Asp Thr Asp Val Phe lie Asn Thr Gly Asn Leu Val Lys 
180 185 190 
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Tyr Leu Leu Asn Leu Asn His Ser Glu Lys Phe Phe Thr Gly Tyr Pro 

205 

Leu lie ASP Asn Tyr Ser Tyr Arg Gly Phe Tyr Gin Lys Thr His He 

•^^^ 220 

ser Tyr Gin Glu Tyr Pro Phe Lys Val Phe Pro Pro Tyr Cys Ser Gly 

235 240 
Leu Gly lyr He Met Ser Arg Asp Leu Val Pro Arg He Tyr Glu Met 

250 255 

Met Gly His val Lys Pro He Lys Phe Glu Asp Val Tyr Val Gly He 

^tin 270 

cys Leu Asn Leu Leu Lys Val Asn He His He Pro Glu Asp Thr Asn 

280 285 

Leu Phe Phe Leu Tyr Arg He His Leu Asp Val Cys Gin Leu Arg Arg 



300 



val He Ala Ala His Gly Phe Ser Ser Lys Glu He He Thr Phe Trp 



315 



320 



Gin Val Met Leu Arg Asn Thr Thr Cys His Tyr 



325 

<210> 13 
<211> 378 
<212> PRT 

<213> Homo sapiens 
<400> 13 

Met Gin Leu Arg Leu Phe Arg Arg Leu Leu Leu Ala Ala Leu Leu Leu 

5 10 15 

val He val Trp Thr Leu Phe Gly Pro Ser Gly Leu Gly Glu Glu Leu 
^° 25 30 

Leu Ser Leu Ser Leu Ala Ser Leu Leu Pro Ala Pro Ala Ser Pro Gly 



40 45 



Pro Pro Leu Ala Leu 
50 



Pro Arg Leu Leu He Pro Asn Gin Glu Ala Cys 



60 



a 
80 



ser Gly Pro Gly Ala Pro Pro Phe Leu Leu He Leu Val Cys Thr Al 

70 75 8 

Pro. Glu Asn Leu Asn Gin Arg Asn Ala He Arg Ala Ser Trp Gly Gly 
85 90 95 

Leu Arg Glu Ala Arg Gly Leu Arg Val Gin Thr Leu Phe Leu Leu Gly 

110 

Glu Pro Asn Ala Gin His Pro Val Trp Gly Ser Gin Gly Ser Asp Leu 



125 
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Ala Ser Glu Ser Ala Ala Gin Gly Asp He Leu Gin Ala Ala Phe Gin 
130 135 140 

Asp Ser Tyr Arg Asn Leu Thr Leu Lys Thr Leu Ser Gly Leu Asn Trp 
145 150 155 160 

Ala Glu Lys His Cys Pro Met Ala Arg Tyr Val Leu Lys Thr Asp Asp 
165 170 175 

Asp Val Tyr Val Asn Val Pro Glu Leu Val Ser Glu Leu Val Leu Arg 
180 185 190 

Gly Gly Arg Trp Gly Gin Trp Glu Arg Ser Thr Glu Pro Gin Arg Glu 
195 200 205 

Ala Glu Gin Glu Gly Gly Gin Val Leu His Ser Glu Glu Val Pro Leu 
210 215 220 

Leu Tyr Leu Gly Arg Val His Trp Arg Val Asn Pro Ser Arg Thr Pro 
225 230 235 240 

Gly Gly Arg Gly Arg Val Ser Glu Glu Gin Trp Pro His Thr Trp Gly 
245 250 255 

Pro Phe Pro Pro Tyr Ala Ser Gly Thr Gly Tyr Val Leu Ser Ala Ser 
260 265 270 

Ala Val Gin Leu He Leu Lys Val Ala Ser Arg Ala Pro Leu Leu Pro 
275 280 285 

Leu Glu Asp Val Phe Val Gly Val Ser Ala Arg Arg Gly Gly Leu Ala 
290 295 300 

Pro Thr Gin Cys Val Lys Leu Ala Gly Ala Thr His Tyr Pro Leu Asp 
305 310 315 320 

Arg Cys Cys Tyr Gly Lys Phe Leu Leu Thr Ser His Arg Leu Asp Pro 
325 330 335 

Trp Lys Met Gin Glu Ala Trp Lys Leu Val Gly Gly Ser Asp Gly Glu 
340 345 350 

Arg Thr Ala Pro Phe Cys Ser Trp Phe Gin Gly Val Leu Gly lie Leu 
355 360 365 

Arg Cys Arg Ala He Ala Trp Leu Gin Ser 
370 375 



<210> 14 

<211> 30 

<212> DNA 

<213> Homo sapiens 

<400> 14 

cggggtaccg gaaacgatga aatatacaag 3 0 
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<210> 15 

<211> 40 

<212> DNA 

<213> Homo sapiens 

<400> 15 

ggcggatcca ggcagatcac agccaagaga acccaaaacg 

<210> 16 
<211> 40 
<212> DNA 
<213> Homo sapiens 

<400> 16 

gcggatccca ggcagatcac agccaagaga acccaaaacg 

<210> 17 
<211> 41 
<212> DMA 
<213> Homo sapiens 

<400> 17 

gcggatcccc aggcagatca cagccaagag aacccaaaac g 4^ 



40 
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